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ABSTRACT 

In this document I develop a weight function theory of zero order basis function interpolants and smoothers. 
In Chapter 1 the basis functions and data spaces are defined directly using weight functions. The data spaces 
are used to formulate the variational problems which define the interpolants and smoothers discussed in 
later chapters. The theory is illustrated using some standard examples of radial basis functions and a class 
of weight functions I will call the tensor product extended B-splines. 

In Chapter 2 the theory of Chapter 1 is used to prove the pointwise convergence of the minimal norm 
basis function interpolant to its data function and to obtain orders of convergence. The data functions 
are characterized locally as Sobolev-like spaces and the results of several numerical experiments using the 

extended B-splines are presented. 

In Chapter 3 a large class of tensor product weight functions will be introduced which I call the central 
difference weight functions. These weight functions are closely related to the extended B-splines and have 

similar properties. The theory of this document is then applied to these weight functions to obtain inter- 
polation convergence results. To understand the theory of interpolation and smoothing it is not necessary 
to read this chapter. 

In Chapter 4 a non-parametric variational smoothing problem will be studied using the theory of this 
document with special interest in its order of pointwise convergence of the smoother to its data function. 
This smoothing problem is the minimal norm interpolation problem stabilized by a smoothing coefBcicnt. 
In Chapter 5 a non-parametric, scalable, variational smoothing problem will be studied, again with special 
interest in its order of pointwise couvcrgcnce to its data function. We discuss the SmoothOperator software 
(freeware) package which implements the Approximate smoother algorithm. It has a full user manual which 
includes several tutorials and data experiments. 
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0.1 Introduction 

In this document I develop a weight function theory of zero order basis function interpolants and 
smoothers. Note that the Appendix of this document contains a list of basic notation, definitions and 
properties also used in this doctimcnt. 

This document had its genesis in the development of a scalable algorithm for Data Mining applications. 
Data Mining is the extraction of complex information from large databases, often having tens of millions 
of records. Scalability means that the time of execution is linearly dependent on the number of records 
processed and this is necessary for the algorithms to have practical execution times. One approach is to 
develop additive regression models and these require the approximation of large numbers of data points 
by surfaces. Here one is concerned with approximating data by surfaces of the form y = f{x), where 
X e R''', y e R and is any dimension. Smoothing algorithms are one way of approximating surfaces and 
in particular we have decided to use a class of non-parametric smoothers called basis function smoothers, 
which solve a variational smoothing problem over a semi-Hilbert space of continuous data functions and 
express the solution in terms of a single basis function. 
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I started my Masters degree (supervised by Dr. Markus Hegland and Dr. Steve Roberts at the ANU, 
Canberra, Australia) searching for a scalable basis function smoothing algorithm and had the good 
fortune to devise such an algorithm (unpublished) by approximating, on a regular grid, the convolution in 
Definition 38 of the space Jq in Dyn's review article [3] . In this document I develop some theoretical tools 
to construct and analyze this algorithm for the case of positive order basis functions. In the document 
Williams [13] the zero order case is studied. This theoretical approach applies in any dimension but the 
smoothing algorithm is only practical up to about three dimensions. In higher dimensions the matrices 
are too large to put into computer memory. 

Dr. Hegland was particularly interested in using the tensor product hat (triangular) function as a basis 
function. At about this time we had a visit by the late Professor Will Light who showed me his paper 
[8] which defined basis functions in terms of weight functions using the Fourier transform. Light and 
Wayne's weight function properties were designed for positive order basis functions which excluded the 
tensor product hat function. They were designed for the well-known 'classical' radial weight functions. 
I therefore developed a version of his theory designed to generate zero order basis functions, including 
both tensor product and radial types. This theory is developed in Chapter 1 and requires that the basis 
functions have Fourier transforms which can take zero values outside the origin since this is a property 
of hat functions. 

Chapter by chapter in brief: 

• Chapter 1: The goal of this chapter is to extend the theoretical work of Light and Wayne in [8] 
to allow classes of weight functions analogous to the positive order weight functions developed in 
Chapter 1 of Williams [13]. Both the inner product data space and the basis functions are defined 
in terms of the weight function. We then prove the completeness and smoothness properties of the 
data space as well as the continuity and positive definiteness properties of the basis function. We 
use as examples the radial weight functions: the thin-plate splines, the Gaussian and the Sobolev 
spline. Special attention is paid to a class of tensor product weight functions we call the extended 
B-splines which are the convolutions of hat functions. 

• Chapter 2 studies the minimum seminorm interpolation problem. Topics include the existence 
and uniqueness of the basis function solution, a matrix equation for the solution and the pointwise 
convergence to the interpolant to its data function. Pointwise order of convergence results are 
derived using two techniques. One approach uses Lagrange interpolation techniques and the second 
does not. Numerical experiments were undertaken using the extended B-splines and special data 
functions were constructed for these experiments. This led to the extended B-splines data functions 
being characterized locally as Sobolev-like spaces. 

• Chapter 3: This chapter introduces a large new class of weight functions which I call the central 
difference weight functions because they are based on central difference operators. They are closely 
related to the extended B-splines. The central difference basis functions are calculated and smooth- 
ness estimates and upper and lower bounds are derived for the weight functions and the basis 
functions. Like the extended B-splines their data functions are characterized locally as Sobolev-like 
spaces. Pointwise convergence results are derived for the basis function interpolant using results 
from Chapter 2. These give orders of convergence identical to those obtained for the extended 
B-splines in. We do not present the results of any numerical experiments. 

• Chapter 4 studies the well-known Exact smoother problem (our terminology) which stabilizes 
the interpolant by adding a smoothing coefficient to the seminorm functional. Topics include the 
existence and uniqueness of the basis function solution, a matrix equation for the solution and the 
pointwise convergence of the solution to its data function. Orders of convergence (also called error 
orders) are derived which, in special sense, are the same as those obtained for the interpolant. 

• Chapter 5 studies the scalable Approximate smoother (our terminology) which turns out to have 
a discretization process similar to that described in Garcke and Griebel [4]. Topics include the 
existence and uniqueness of the smoother, matrix equations for the smoother, and the pointwise 
convergence of this smoother to the Exact smoother and to its data function. Orders of pointwise 
convergence are derived for the convergence of the Approximate to the Exact smoother which are 
then combined with the Exact smoother error formulas of Chapter 4 to obtain error orders for the 
Approximate smoother. 
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Chapter by chapter in more detail: 



0.1.1 Chapter 1 Weight functions, data spaces and basis functions 

In his paper [2] Duchon studied positive order basis function interpolation. Basically he began by gen- 
eralizing the functional / |£>"u|^ using a positive weight function w and the Fourier transform to 



\(x\=m 



obtain / w ^ \d'^u , and then devoted the paper to studying the special case t« = |-|^**. In [8] Light 

K"* \a\=m 

and Wayne carried on the study of the positive order interpolation problem where Duchon left off. Their 
weight function theory used a strictly positive weight function to directly define the basis functions and 
Hilbert spaces. 

In this document I adapt the work of Light and Wayne to the study the much simpler zero order 
problem. Here the weight function is only assumed to be a.e. positive and the conditions placed on the 
weight functions are expressed in terms of integrals. The weaker condition on the weight function allows 
for basis functions whose Fourier transforms have zeros e.g. the hat or triangular function. Happily, the 
zero order theory presented here only requires simple function space theory, the positive order results 
being considerably more complex - see Williams [13]. 

The theory is illustrated using the standard radial basis functions: the shifted thin-plate splines, the 
Gaussian and the Sobolev splines but I am especially interested in basis functions that are the tensor 
product of the derivatives of hat (triangle) functions, denoted A, which I call the extended B-splines. 
However, as mentioned previously, the Fourier transform of these basis functions has zero values on a set 
of measure zero and the theory has to allow for this. Thus I begin by using a class of a.e. positive and 
continuous weight functions w satisfying (property W2): J^^ '^(^'^•^ < 00 for < A < k, to define the 
zero order data space 



X^ = [fGS': fe LL and V^fG L^} . 



This turns out to be a reproducing kernel Hilbert space of continuous functions with norm ||/||^ q '^^'^ 
inner product (/, 5)^ ~ / I* is shown, for example, that X° C'd'*]) and that 5 fl is dense 
in Xl 

The weight function is then used to define a zero order basis function G by the simple Fourier transform 
formula G = ^. The smoothness obtained is G G C^^'*^'' and several results are proved regarding the 
tensor products of weight functions and basis functions, as well as the products and convolutions of basis 
functions. 

The extended B-splines are discussed in detail. It turns out that the corresponding basis functions are 
the derivatives of convolutions of hat (triangle) functions and hence their name. 



0.1.2 Chapter 2 The minimal norm interpolant 

In this chapter the minimal norm interpolation problem is solved and several pointwise convergence 
results are derived. These are illustrated with numerical results obtained using several extended B-spline 
basis functions and special classes of data functions. In more detail, the functions from the Hilbert data 
space X% are used to define the standard minimal norm interpolation problem with independent data 

X = {a;^*)}^ and dependent data y = {yi}^^^, the latter being obtained by evaluating a data function 

N 

at X. This problem is then shown to have a unique basis function solution of the form ^ a^G (• — a;*-*-*) 

i=l 

with ai G C. The standard matrix equation for the q,; is then derived. 

We will then consider several categories of estimates for the pointwise convergence of the interpolant 
to its data function when the data is confined to a bounded data region. In all these results the order of 
convergence appears as the power of the radius of the largest ball in the data region that can be fitted 
between the X data points. The convergence results of this document can be divided into those which 
use Lagrange interpolation theory, and thus assume unisolvent data X (Definition 66), and those which 
do not. The non-unisolvency proofs are much simpler than the unisolvency-based results and if the data 
functions are chosen appropriately the constants can be calculated. However, the maximum order of 
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convergence obtained is 1, no matter how large the parameter k. On the other hand, the constants for 
the unisolvency-based results are difficult to calculate but the order of convergence is equal to the value 
of max [kJ . 



Non-unisolvent data: Type 1 interpolation error estimates Suppose that the weight function has prop- 



erty W2 for some /t > 0, that the data X is contained in a closed, bounded, infinite data set K, and that 
the basis function G satisfies the estimate 



|G(0) -ReG(a;)| < Cg\x 



2s 



b| < hr 



(1) 



for some ha > 0. Then in Theorem 59 it is shown that the interpolant Ix / of a data function / e X° 

ScLtlsfi-GS 

\f{x)-Ixf{x)\<kG\\fL,oihx,Kr, ^^K, (2) 

when hx,K = maxdist {x,X) < ho and ka = (27r) * \/2Cg- This implies an order of convergence of 
at least s. 

These results are summarized in Table 2.1 (a copy of Table 2.1). 





Interpolant error estimates 




Non-unisolvent Type 1 estimates: k > 0, ka = (27r) \/2Cg- 






Parameter 


Converg. 






Weight function 


constraints 


order s 


Cg 


he 


Sobolev splines 


v--l = l 


i 

2 




oo 


{v > d/2) 




1 


\\D K:^_a/2\\^ 

2-r(„) 




Shifted thin-platc 
{-d/2 <v<Q) 




1 


eq. (2.26) 


oo 


Gaussian 




1 


2e-3/2 


oo 


Extended B-spline 




1 

2 




oo 



TABLE 1. 



Non-unisolvent data: Type 2 interpolation error estimates Here we avoid making any assumptions 



about G and instead assume k> 1 and use the properties of the Riesz representer of the evaluation 
functional / — > / (a;). A consequence of this approach is the estimate (Theorem 65): when hx,K < oo 

1/ ix) - Ixf {x)\ < kG ||/IL,o hx,K, xeK, fexl (3) 

where 

kG = (277)-''/' V- (V^G) (0). 

so the order of convergence is always at least 1. Table 2 (a copy of Table 2.2) summarizes the results 

for the weight function examples: 



Unisolvent data - interpolation error estimates 



Here X is assumed to have a unisolvent subset of 
order max [k\ and is contained in an open, bounded data region SI. In this case it follows from Theorem 
74 that there exist constants /la,^, fcc > such that 

\f{x)-Ixfix)\<kG\\f\L^oihx,a)^''^ , xen,feXl (4) 
when hx.n = sup dist {lo, X) < /io.k- Table 3 (a copy of Table 2.3) summarizes our results: 

Numerical results are only presented for the non-unisolvent data cases. Numerical results 
are presented which illustrate the convergence of the interpolant to its data function. We will only 
be interested in the convergence of the interpolant to it's data function and not in the algorithm's 
performance as an interpolant. Only the extended B-splines will be considered and we will also restrict 
ourselves to one dimension so that the data density parameter hx,n can be easily calculated. 

If we can calculate the data function norm we can calculate the error estimate. All the extended B-spline 
basis weight functions have a power of sin^ x in the denominator and so we have derived special classes 
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Interpolant error estimates 
Non-unisolvent Type 2 estimate: k> 1 


Weight function 


Parameter 
constraints 


Converg. 
order 


(r, \d/4: , ,ri 


Sobolev splines 
[v > d/2) 


z;-|>2 


1 


/r(t)-d/2-l) 

V 2d/2+ir{v) 


l<v-^<2 




1 T{v-d/2-l) 

V 22"-''/2-3r(v) 


Shifted thin-plate 
{-d/2 <v<Q) 




1 


^-2v 


Gaussian 




1 


V2 


Extended B-spline 
(1 < n < 


n>2 


1 


^-Gi {Qf-^D'^G^ (0) 



TABLE 2. 



Interpolant error estimates 
Unisolvent data: k > 1 


Weight function 


Parameter 

constraints 


Convergence 
orders ([kJ ) 


Sobolev splines 
{v > d/2) 


i;- 1 = 2,3,4,... 


[v - d/2\ - 1 


v-i>l,v-^(/Z+ 


iv - d/2\ 


Shifted thin-plate spline 




2,3,4,... 


Gaussian 




2,3,4,... 


Extended B-spline 
(1 < n < 


n>2 


n- 1 



TABLE 3. 



of data functions for which the data function norm can be calculated. The derivation of these special 
classes of data functions led to the characterization of the restriction spaces for various class of 

weight function. For example, Theorem 90 shows that the restrictions of the B-spline data functions are 
a member of the class of Sobolev-like spaces: 

(n) = {ue (n) : D'^ue {n) for ai <m, <i<m} , m= 1,2,3,... (5) 

It is easy to construct functions that are in these spaces. 

As expected interpolant instability is evident and because our error estimates assume an infinite 
precision we filter the error to remove spikes which are a manifestation of the instability. 

0.1.3 Chapter 3 The central difference weight functions 

To understand the theory of interpolants and smoothers presented in this document it is not necessary to 
read this chapter which introduces the central difference weight functions. However, we note that Section 
3.5 contains local data space results specifically designed for tensor product weight functions and that 
central difference weight functions are used as examples in the chapters dealing with smoothers. 

This chapter introduces a large new class of weight functions based on central difference operators. 
The basis functions are calculated and smoothness estimates and upper and lower bounds are derived 
for the weight functions and the basis functions. 

Like the extended B-splines, their data fimctions are characterized locally as Sobolev-like spaces. 

Pointwise convergence results arc derived for the basis function interpolant using results from Chapter 
2. These give orders of convergence identical to those obtained for the extended B-splines in. This chapter 
presents no numerical interpolation experiments. 

This is a large class of weight functions and I have only calculated a couple of basis functions. The 
calculations can be tedious so one really needs to be able to have theoretical results which enable you 
to tell beforehand that the basis function has some especially good properties regarding interpolation - 
I have no results to offer in this regard. 
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The central difference weight functions are defined as follows: Suppose q & <? 7^ 0) 9 (0 — ^ 

and I > n > 1 are integers. The univariate central difference weight function with parameters n, I is 
defined by 

where A21 is central difference operator 

I 

A2;/(0= E (-l)'(fe+;)/(-fcO, / = 1,2,3,..., ^eR^ 

For example, A2/ (^) = — (/ (0 " 2/ (0) + / (—0)- The multivariate central difference weight function 
is defined by tensor product. 

It is shown in Theorem 98 that w belongs to the class of zero order weight functions introduced in 
Chapter 1 for some k iff / Q {£,) ^-^ < fo'" some R > 0. Here k satisfies k + 1/2 < n. The 

l«l>-R 

central difference weight functions are closely related to the extended B-splines defined by 1.20 and a 
discussion of their genesis is given in Subsection 3.2.1. 

Various bounds are derived for w. For example, in Corollary 109 it is shown that if i|j|>^ t^'^q (t) dt < 00 
then for any r > there exist constants c^, c^, fc^, fc^ > such that 



<wiO< 



I c2n 



1^1 > r, 

lei < r. 



^2(l-n) — VS; — ^2(/-n) ' 

By Theorems 111 and 115 the univariate central difference basis function is 



G{s) = i-lf-^^J (d^C-") (*A)') (I) \tf"-'q{t)dt, 



8 e 



with G e C^s"' and bounded in C(2"-i) (M\0). The multivariate basis function is defined 

as a tensor product. For fc < 2n — 2, D'^G is uniformly Lipschitz continuous of order 1 (Theorem 118). 

In Section 3.5 we will characterize the data function space locally as a Sobolev-like space 5. A local- 
ization result specifically for tensor product weight functions is derived in Theorem 123. This supplies 
important information about the data functions and makes it easy to choose data functions for numerical 
experiments. 

Results for the pointwise convergence of the interpolant to its data function on a bounded data domain 

are derived using results from Chapter 1 . These results arc summarized here in Table 4 (a copy of Table 
3.1) and give pointwise orders of convergence identical to those obtained for the extended B-splines. 



Interpolant convergence to data function - central difference weight function. 


Estimate 
name 


Unisolv. 
data? 


Parameter 
constraints 


Converg. 
order 




ha 


Type 1 


no 


q b'nded if n = 1 
else n > 2 


1/2 


^2G, i0f-'\\DG^\\^n 


00 


Type 2 


no 


n > 2 


1 


^-Gi {Qf-^ D^Gi {0)Vd 


00 


Unisolvent 


yes 


n > 2 


n-1 




00 



TABLE 4. 

This chapter does not present the results of any numerical experiments. 



0.1.4 Chapter 4 The Exact smoother 

We call this well-known basis function smoother the Exact smoother because the smoother studied in 
the next chapter approximates it. We will assume the basis function is real-valued. The Exact smoother 
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minimizes the functional 

/'ll/C + ]vEk(^^^^)-2/^| ' (6) 

i=l 

over the data (function) space X° where p > is termed the smoothing coefficient. It is shown that 
this problem, like the interpolation problem, has a unique basis function solution in the space Wg,x- 
The finite dimensionality of the solution allows us to derive matrix equations for the coefficients a, of 
the data- translated basis functions. 

Three estimates are derived for the order of the pointwise error of the Exact smoother w.r.t. its data 
function: two estimates do not assume unisolvent X data and one does. When p = these estimates 
correspond to interpolation results of Subsection 2.5.2 and the Exact smoother convergence orders and 
the constants are the same as those for the interpolation case which are given in the interpolation tables 
1, 2, 3 and 4. 



Non-unisolvent data: Type 1 error estimates When the weight function has property W2 for k > 
it is assumed that the basis function satisfies an inequality of the form 1 and that the data region is a 
closed, bounded, infinite set K. In this case it is shown in Corollary 153 that the Exact smoother Sg of 
the data function / satisfies the error estimate 

\fix)-Seix)\<\\f\\^^o{VpN + kG{hx,Kr), xeK, (7) 

when hx,K = maxdist {x, X) < he and ka = (27r)~'^^^ s/ICg- 



Non-unisolvent data: Type 2 error estimates If it only assumed that k> 1 then it is shown in The- 
orem 156 that 

|/(ar)-SeW| < ||/|L,o(Vp^+fcG/ix,K), x€K, (8) 
when hx,K < oo and kc = (27r)"'^/* ^- (V^G) (0)\/d. 



Unisolvent data error estimates On the other hand, suppose the weight function has property W2 for 
some parameter k > 1, the independent data X contains a unisolvent set of order [kJ and X is contained 
in an open, bounded data region fl. Then using results from the Lagrange theory of interpolation we 
show in Theorem 160 that there exist constants K'^ ^, A:g > such that 

\f{x)-s,ix)\<\\f\\^^o[K'a^^^ + kG{hx,n)^^^), xeU, (9) 

when hx.n = sup dist {x, X) < ho- 

These theoretical results will be illustrated using the weight function examples from the interpolation 
chapter, namely the radial shifted thin-plate splines, Gaussian and Sobolev splines and the tensor product 
extended B-splines. We will also use the central difference weight functions from Chapter 3. 

Numerical results are only presented for the non-unisolvent data Ccises. Numeric experi- 
ments are carried out using the same 1-dimensional B-splines and data functions that were used for the 
interpolants. We restrict ourselves to one dimension so that the data density parameters hx,Q, and hx,K 
can be easily calculated. 



0.1.5 Chapter 5 The Approximate smoother 

We call this smoother the Approximate smoother because it approximates the Exact smoother. This is a 
non-parametric, scalable smoother. Here scalable means the numeric effort to calculate the Approximate 
smoother depends linearly on the number of data points. We assume the basis function is real-valued. 

Two different approaches will be taken to defining the Approximate smoother, and both involve for- 
mulating the smoother as the solution of a variational problem. One of these problems will involve 
minimizing the Exact smoother functional 6 over Wg,x' where X' ~ {x'il^^i is an arbitrary set of dis- 
tinct points in R''. The other, equivalent problem, involves finding the function in Wg,x' which is nearest 
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to Exact smoother Sg w.r.t. the norm ||-||^ q. If 

TV' N' 

Sa {X) = Yl (^) = (2^)"'^' E - ' 

i=l j=l 

denotes the Approximate smoother and y is the dependent data then solving the second problem yields 

Sa = Ix'Se, 

which implies the matrix equation 

{NpRx',x' + Rjc,x'Rx,x') a' = RJcx'V, 



where Rx,x' = [Rx'^ i^^'^))- The size of the Approximate smoother matrix is N' x N' which is 
independent of the number of data points and suggests scalability. 

The error estimates for the pointwise convergence of the Approximate smoother to its data function 
/ e are based on the simple triangle inequality 

1/ (a;) - So (a;)| < |/ (x) - Se {x)\ + \Se (x) - Sa {x)\ , 

and so Section 5.6 will be devoted to estimating \se (x) — Sa {x)\. 

As with the minimal interpolant and the Exact smoother, we will obtain estimates that assume uni- 
solvent data sets as well as the Type 1 and Type 2 estimates that do not involve unisolvency. The 
Approximate smoother convergence orders and the constants are the same as those for the interpolation 
case which are given in the interpolation tables 1, 2, 3 and 4. 



Non-unisolvent data: Type 1 error estimates No a priori assumption is made concerning the weight 



function parameter k but it will be assumed that the basis function satisfies an inequality of the form 1. 
For example, if it is assumed that the data region K is closed, bounded and infinite then Theorem 189 
establishes that 

\Se{x)-Sa{x)\<\\f\\^^okG{hx',Ky, X € K, (10) 

and Theorem 206 shows that 



\f{x)-Sa{x)\<\\f\\^ ,,(^./^ + kG{hx,Kr + kGihx',Ky), X & K, (11) 



when hx,K = maxdist {x, X) < ha and hx',K = maxdist (x, X') < ha- 



Non-unisolvent data: Type 2 error estimates If it only assumed that k>1 then by Theorem 196 



\Se{x)-Sa{x)\<\\f\\^^okG{hx',Kr , X £ R"^ , (12) 

and by Theorem 208 

\f{x)-Sa{x)\<\\f\\^o{Vp^+''Ghx,K + kGhx',K), x€R'^, (13) 

where ka = (27r)"^ ^- (V^G) {0)Vd. 

If X is a unisolvent set of order m > 1 contained in an open, bounded 



Unisolvent data error estimates 



data region fl then by Theorem 200 

|Se (x)-Sa(x) I < 11/11^^0 fee (/iJf'.n)", X G H, (14) 

and by Theorem 211 

\fix)-s^{x)\<\\f\\^,,(^Ki,^^^+kG{hx,Kr + kGihx',Kr), xeU, (15) 
for some constants K'^ ^, fee > 0. We say the orders of convergence are at least m. 
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These theoretical results will be illustrated using the weight function examples from the interpolation 
chapter, namely the radial shifted thin-plate splines, Gaussian and Sobolev splines and the tensor product 
extended B- splines. Wc will also use the eentral difference weight functions from Chapter 3. 

Numerical results are only presented for the non-unisolvent data cases. Numeric experi- 
ments are carried out using the same 1-dimensional B-splines and data functions that were used for the 
intcrpolants. We restrict ourselves to one dimension so that the data density parameters hx,Q. and hx,K 
can be easily calculated. 

We discuss the SmoothOperator software (freeware) package which implements the Approximate 
smoother algorithm. It has a full user manual which describe several tutorials and data experiments. 
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Weight functions, data spaces and basis functions 



1.1 Introduction 

In [8] Light and Wayne developed a weight function theory of positive order basis function interpolation in 
which the weight function directly generated both the basis function and the Hilbert space of continuous 
data functions. We intend to do the same thing here for the zero order case. 

We start by introducing the theory of weight functions using as examples the radial basis functions: 
the shifted thin-plate splines, the Gaussian, the Sobolev splines as well as the tensor product extended 
B-spline weight functions. Results are also proved regarding the tensor products and products of weight 
functions. 

The weight functions are then used to define both the Hilbert data spaces and the basis functions which 
will be used to formulate and solve the variational interpolation and smoothing problems discussed in 
later Chapters. Various continuity results are derived for the basis functions and data space functions. 
The data space is shown to be a reproducing kernel Hilbert space but instead of using the reproducing 
kernel we will use the Riesz representers of the evaluation functional / D"f (x) because the kernel 
is only defined for a = 0. 

1.2 The weight functions 

1.2.1 Motivation for the weight function properties 

We want to modify Light and Wayne's weight function properties and basis function definition so that 
the tensor product hat functions are basis functions of zero order generated by weight functions. We also 
hope to use weight and basis functions that require only simple Fourier transform results and avoid 
subspaccs of the tempered distributions. 

Light and Wayne defined their weight function w to have the three properties: 

A3.1 w is continuous and positive outside the origin. 
A3.2 i; e Ll^. 

A3. 3 For some R> and fi real, < c \x\^ for |x| > R. 

These properties imply ^ & S' and Light and Wayne's definition of a tempered basis distribution 
G G S' of order A; > 1 generated by w (see Section 3 of [8]) imply that as distributions 

1 
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Now, as mentioned above, we want to define a basis function so the multivariate hat function is a basis 
function of order zero. The one dimensional hat function will be denoted by A and is given by 

r 0, N>i, xGR, 

^''> \ l-\x\, \x\ < 1, xgR. ^ ' 

In higher dimensions the hat function is defined as the tensor product 

d 



i=l 

It is well known that 



A(a;) = ]jA(xi), x e M"^. (1.2) 

i=l 

U) = i2ny''' {^^)\ eeM\ (1.3) 
so that in higher dimensions 

A(0 = nA(^,) = (2-)-'^/'n(^)', (1.4) 

i=l i=l ^ ^ 

In one dimension, if A is to be basis function of order zero then we must have A (^) = i.e. 

2 



W 



and w has discontinuities outside the origin, which violates weight function property A3.1. Denoting 
the set of discontinuities by A we have that A is the union of hyperplanes 

d 

A=\J{^:^iG 27rZ\0} . (1.6) 

i=l 

and this is a closed set of measure zero. This will become weight function property Wl in Definition 
1 below. 

Additionally, from equation 1.4 we have that A = ^ E and we want to only use Fourier transform 
results. So we will say that G is a basis function of order if — G and G = —. A well-known 

^ WW 

result then implies G G C^^ C S' and so G is a function. That ^ € is ensured by property W2 in 
Definition 1. Note that our definition of a basis function gives a unique basis function whereas in Light 
and Wayne a basis function of higher order k is unique modulo a polynomial of order < k. 

Finally we will incorporate the parameter k into property W2 to increase the smoothness of the basis 
function and to allow a larger choice of weight functions. The parameter k will be allowed to take 
non-integer values. 



1.2.2 The weight function properties 

Definition 1 Weight functions with real parameter k > 

A weight function's properties are defined with reference to a set A C K'' which is closed and has 
m,easure zero. The introduction of A is motivated directly by the properties of the Fourier transform of 
the hat function. 

In this document a weight function w is a mapping w : M** — > M which has at least the property Wl: 

Wl There exists a closed set A with measure zero such that w is continuous and positive outside A i.e. 
w e G(°) {R'^XA) andw>0 on W^XA. 

It may also have property W2: 



W2 For some real number k > 

-i^L^da; < oo, < A < k. (1.7) 
w[x) 



I 
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Remark 2 1. In Section 1.3 property Wl is used to define the inner product space of distributions 
. Property Wl is needed to ensure that is not empty - see Theorem 9. It is not enough just 
to have a weight function that is positive a. e. 

2. Property Wl allows basis functions to have Fourier transforms with zeros outside the origin e.g. 

the tensor product hat function. 

3. I have chosen the integral form for W2 because it is more general than the form A3. 3 of Light and 
Wayne. 

4. Property W2 allows the definition of a continuous basis function of order zero. 

5. The parameter k of property W2 could be called the smoothness parameter because in Theorem I4 
it is shown that c C^^'^^^ and in Theorem 24 it is shown that the basis functions are in C^'^'^^K 
The allowance of non-integer values of k sometimes permits an extra degree of differentiability to 
be estimated e.g. The smoothness of Sobolev spline basis functions in Subsubsection 1.4.2. 

6. Scaled (or dilated) weight functions are also weight functions with the same parameter k. 

7. Properties such as: J — — ^dx < 00 for < X < k, and bounded for n < v, might be 

w{x) ^ ' 

investigated in order to obtain Sobolev space results. 

The following result gives two equivalent criteria for the weight function property W2. 

Theorem 3 The the following criteria are equivalent to weight function property W2. 

„2/3 |^|«- W 

i- I dx<^, |/3|<L«J. 

|.|2'= 

2. ^ e L]^^ and, J — — < 00 for any R>0. 

\.\>R 

Proof. Part 1 is easily proved using the identity = ^ W.^"^^ ' clearly W2 implies part 2. 

Finally, if part 2 holds for < A < k and any R> 

i2A ^ I |2A I |2A , 11 ,2k 



J w J w J w J w 



|.|<H |.|>ii \-\<n. i\>n. 



<ii 



2\ 



1 /' 1 



W 



^2(«-A) 



|.|<fi |.|>fi 

< 00, 

and so property W2 is satisfied. ■ 

1.2.3 Examples: radial weight functions 
Shifted thin-plate splines 

From equations 25, 26 and 27 of Dyn [3] the shifted thin-plate spline functions 

H{x)^{ L ^ > (1.8) 

{\^\xf) , t;>-d/2, t;^0,l,2,3..., 

have the distribution Fourier transforms 

U (C) = e(y) K,+d/2 (lei) ler'""'' on M^O, v > -d/2, 
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where e{v) > 0, K\{t) = t^Kx{t), t > 0,X > 0, and Kx is called a modified Bessel function or 
MacDonald's function. Here K\ has the properties 

Kx e C(°) (M^) , A > ^; Kx(t)>0; lim Kx (t) = exponentially. (1.9) 

2 t— >(x) 

Also, for a given A > 0, it is well-known that there exist positive constants 1 < bx, < cx < c'x such 
that 

w (0 = , (1-10) 

e{v)K,+,/2m) 

CAe-^^l*l < Kx (t) < 4e-l*l, t e R\ (1.11) 

Now if ^ = ^ then 

w 

and w has property Wl for A = {0}. Further, condition 1.7 holds iff 2A — 2w — d > —d for < A < k 
i.e. iff 

-d/2<v<0. (1.12) 
and so when v satisfies 1.12, w has property W2 for all k > 0. 

The Gaussian 



The Gaussian function 



has Fourier transform 



and so if :r: = -f? 



H (x) = exp (- \xf^ , x&R'^, (1.13) 



^(0 = ^exp(-M!], 



^i;(0 = 4expf^|, (1.14) 



■V/TT \ 4 

and w has property Wl for empty A and property W2 for all k > 0. 
Sobolev splines 

If V satisfies v > d/2 the multivariate Sobolev spline 

H{x)= ^^_^^^^^ K,{\x\), areM^ (1.15) 

has Fourier transform 

(l + iei'j 

and so if — = 

w 

^(0- + CGK', (1-16) 

and w has property Wl for empty A and property W2 for < «; < u — rf/2. 

1.2.4 Products of weight functions 

See Theorem 33 where the product of weight functions is discussed in relation to the convolution of basis 
functions. 
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1.2.5 The tensor product of weight functions 

The next theorem gives some general conditions under which the tensor product of two weight functions 
is a weight function. 

Theorem 4 Suppose wi and W2 are weight functions i.e. they have property Wl. 

1. Then the tensor product Wi (g) W2 also has property Wl. 

2. The tensor product wi (g) W2 has property W2 for parameter k iff both wi and W2 have property W2 
for parameter k. 

Proof. Part 1. In this proof meas denotes the measure of a set. Now suppose Wi : M*^* — > M. Then each 

Wi is associated with a weight function set, call it Ai, which is closed and has measure zero. The first 
step will be to construct the set A for wi (g) W2 from the sets Ai. The candidate is 

^ = (^1 X R'^^) U (M''i X A2) . 

We show A is closed and has measure zero. We use the continuous projection operators pi ^ R''' 

2 

defined by pi {x) = x' and p2 {x) = x" . Next, since A = [J pi^ (Ai) and each Ai is closed, A is also 

closed since Pi is continuous. Finally it must be shown that mcas ^ = 0. But since 

meas^ = meas (^1 x U W^' x A2) < meas (^1 x M''^) + meas {W^^ x ^2) , 

it is sufficient to show that meas {Ai x M'^=) = 0. To do this we use the countable additivity property 
of the Lebesgue measure on R'^. In fact 

C30 

meas (^Ai x R"^^) = meas y (Ai X (^5(0; n) - S(0;n- 1))) 

n=l 

00 

= meas (Ai x (b (0; n) - B{0;n- 1))) 

n=l 

= 0. 



Part 2. Suppose wi and W2 have property W2 for parameter k. We show wi (8) W2 also has property 
W2 for parameter k. Recall that property W2 requires that 

-dx < 00, < A < k; « = 1, 2. 



Now 

and the inequality 
allows us to write 



[M 

J Wi {x) 

/ l^dx = / ^— -i-dx, (1.17) 

J W{X) J Wl {X') W2 (X") 

l^r^ < 2^ (|a;f V |x'f ^) , (1.18) 



|2A . 2M kf ^ + b'f ^ 



-dx < I — —- ——^dx 



W {x) J Wl {x') W2 {x") 



\J WI{X')W2{X") J \J WI{X')W2{X") J 

_nx( f Wf^dx f dx \ ^( f dx' f \x"f^dx" ' 

\J Wl {X') J W2 {X") )^ \J Wl {X') J W2{X") ^ 



< 00. 
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Now suppose wi (8) W2 has property W2 for parameter k. Thus, by 1.17 

A 



/ 



(k'i' + k'f)' 

^ ^ dx<oo, 0<X<K, 



Wi {x') W2 {x") 
I /j2A |^"|^^ 

so that / — Tin < °° / T~ir\ rT\'^^ < 

Ix'l"^^ \x"\'^^ 
Hence f r-da;' < oo and f r——dx" < oo i.e. wi and W2 both have property W2 for parameter 

Wi [X') W2 (X") 

K. ■ 

The next resuh generahzes the previous theorem to an arbitrary number of weight functions. 
Corollary 5 Suppose the functions {wi^^^-^ satisfy property WI of a weight function. 

n 

1. Then has property WI. 

n 

2. Wi has property W2 for parameter k iff each Wi has property W2 for parameter k. 

i=l 

Proof. An easy proof by induction using Theorem 4. ■ 

1.2.6 The hat weight function and the weight function properties 

The next theorem shows under what conditions the multivariate hat function is a basis function of order 

zero. 

Theorem 6 Using the multivariate hat function A define the function wa by 

Then wa satisfies the weight function properties WI and W2 for the parameter k in the following 
manner: 

1. Wa satisfies property WI for all k. 

2. Wa satisfies property W2 for parameter n iff k < 1/2. 

Proof. Part 1 By 1.6, wa is the set A which is closed with measure zero. 



Part 2 Part 2 of Corollary 5 tells us that we need only establish property W2 in one-dimension. Now 
W2 holds for k iff 

= / x^^ ^—^dx < oo, < A < AC. 



wa{x) J {x/2f 
But these integrals will exist iff they exist near infinity iff 2 — 2A > 1 i.e. iff A < 1/2. ■ 

1.2.7 The extended B-spline weight functions 

We now introduce a new class of weight functions which satisfy property W2. We call this class the 
extended B-spline weight functions because in Subsection 1.4.4 it will be shown that they generate basis 
functions which are the derivatives of the B-splines. 

Theorem 7 The extended B-spline weight functions Suppose k > is an integer. For given 
integers l,n>0 define the extended B-spline weight function w by 

d 2n 

"'(^) = TT-^^' x={xi,...,Xd)eR'^. (1.20) 
f-^^ sm^' Xi 
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Define the closed set of measure zero A to be the union of hyperplanes 

d 

A= y \J{x:Xi = -na} . 

Then the function w is a weight function with property Wl w.r.t. A. Also, the weight function w has 
property W2 for k iff n and I satisfy 

K + 1/2 <n<l. (1.21) 

Proof. Clearly property Wl is satisfied for ^ so w is a weight function. Corollary 5 tells us that we need 
only establish the criterion 1.21 in one dimension. Now W2 holds for k iff 

r ^ I ^2X^2^^^ 0<X<K. (1.22) 

But these integrals will exist if and only if they exist near the origin and near infinity. They will exist 
near the origin iff 2A + 2/ — 2n > —1 i.e. iff A > n — l — 1/2, and they will exist near infinity iff 2A — 2n < — 1 
i.e. iff A < n- 1/2. Thus the integrals all exist iff n- / - 1/2 < A < n- 1/2 for < A < k iff k < n- 1/2 
and n < / + 1/2 iflf /t + 1/2 < n < Z. ■ 
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We now introduce the zero order data spaces which will later be used to define the minimum norm 
interpolant. Here w denotes the weight function. In Chapters 2, 4 and 5, will be used to define 
several variational interpolation and smoothing problems. In a manner analogous to the Sobolcv spaces, 
mappings between X° and are used to show that X° is a Hilbert space. Various smoothness and C°° 
density results are then established. 



Definition 8 The zero order inner product space X^ 

Suppose w is a weight function i.e. it only has property Wl of Definition 1. Then define 



(1.23) 



and endow it with the norm and inner product 

. 2 







j w 


f 



wfg. 



= 



That \\-\\^ Q is a norm is simple to prove. Suppose ||/||^ o ~ ^- '^hen, since w > a.e., J w 
implies that f (x) =0 a.e. and thus f = in the distribution sense. 

The next result reassures us that the space X^ is non-empty. 
Theorem 9 If w is a weight function w.r.t. the closed set A of measure zero then 

^u-.uGCg' and suppucW^XA^ <Z Xl, (1.24) 

where the set on the left is non-empty. 

Proof. Firstly, ]R''\^ is not empty since ^ = M'' implies meas^ ^ 0. Clearly 

/ e |w : w e and suppu C ]R''\yl| implies / e 5 C 5' and / e C Lj^^. Also, since / has 
bounded support in and w is positive and continuous on 



V^f 



< oo. 



18 



1. Weight functions, data spaces and basis functions 



1.3.1 The completeness of 

In a manner analogous to Sobolev space theory, the completeness of X° will be established by construct- 
ing an isometric homeomorphism I : — > and then making use of the completeness of L^. 

Definition 10 The linear mappings I and J 

Suppose w is a weight function. Using the definition of the space X° we can define the linear mapping 



If. in addition, we assume property W2 then & C S' when g G L"^, and we can then define the 
linear mapping J : LP' S' by 



The linear mappings X and J' have the following properties: 
Theorem 11 Suppose w has property Wl of a weight function. Then: 

1. I : — > is an isometry. 

2. I is one-to-one. 

Now suppose that in addition to property Wl, w has property W2. Then: 

3. J : L'^ ^ X^ and is an isometry. 
4- J is one-to-one. 

5. J oX = I on X° , and I o J = I on L"^ . 

6. The mapping T : X^ — > is an isometric homeomorphism with inverse JT". 

7. The mappings I and J are adjoints. 

Proof. The proofs are straight forward and will be omitted. ■ 

The fact that is complete, and hence is a Hilbcrt space, is a simple consequence of the last theorem: 

Corollary 12 Suppose w is a weight function with property W2. Then X^, is complete and hence a 
Hubert space. 

Proof. By part 6 of the previous Theorem 11,1: X° — > is an isometric homeomorphism. Hence X° 
is complete since is a Hilbert space. ■ 

1.3.2 The smoothness of functions in 

We begin with a lemma of basic Fourier transform results. 

Lemma 13 



1. (Theorem 4.2 of Malliavin [10]) If f € S' and f e L\ then f € C'^g' , \f {x)\ as \x\ oo, and 



2. (Corollary 2.12 of Petersen [11]) If f € and for some integer n > 1, |-|"/ € L^, then f € C] 
Further, x^f G when < n and 




by 



If={V^f) , f€Xl 





V 




(1.25) 



The next theorem presents our first smoothness result for functions in X[ 
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Theorem 14 Suppose thatw is a weight function with property W2 for parameter k. Then C Cg'^^^ 
Proof. Since w has property W2, by Theorem 3, if \f3\ < k 



r\ |2|/3| . I |2|/3| ^ I |2|/3| ^ ^ ^ 

/ ^-^ = / + / < / - + / — < OO. 

J w J w J w J w J w 

M<i M>i M<i M>i 



i2k 



(1.26) 



Hence when / e X° the Cauchy-Schwartz inequahty yields 



W 



r,0 < 



i.e. / G X° implies D^/ G when < n and hence / G Cg by part 1 of Lemma 13. Further, 
again by part 1 of Lemma 13 



\DPf{x)\={2^) 



-d/2 



j e"f^/(0 



< (27r) 



-d/2 



The next theorem derives some inverse Fourier transform formulas and inequalities for derivatives of 
functions in X^. 

Theorem 15 Suppose that w is a weight function with property W2 for parameter k. Then for f G X^ 
we have the inverse Fourier transform formulas 



< K. 



D^f {x) = (277)-'^/' j e'-^DPf (0 d^, 
and the derivatives satisfy 

\D^f{x)\ < (27r)-'^/2 f y Lj ||/||^_^, \^\ < 
so that X^ ^ C^'-'*-' ^ when ^ is endowed with the usual supremum norm. 



(1.27) 



(1.28) 



Proof. From the proof of the previous theorem we have: / S X^ implies Dl^f E when \/3\ < k. The 
inverse Fourier transform formulas 1.27 are now simple consequences of part 1 of Lemma 13. 
The bounds on the derivatives are derived from 1.27 using the Cauchy-Schwartz inequality: 



D^fix)] < (27r)-^ J |7(5)| d^ < {2n)-i J ^-^V^\f\ < (Stt)"* ( J 



|2|/3|\ 3 



Wwfi ■ 



Remark 16 Inequality 1.28 for (3 = implies that X^ is a reproducing kernel Hilbert space and in 
Section 1.5 we will calculate the Riesz representers of the evaluation functionals f — > D"/ (a;). We have 
thus established an important link between pointwise processes and Hilbert space theory. 



1.3.3 Examples 
Shifted thin-platc splines 

From Examples 1.2.3, w has property W2 for all k > 0, and thus Theorem 15 implies X° c Cg' as sets. 
The Gaussian 

It was shown in Examples 1.2.3 that w has property W2 for all k > 0, and so Theorem 14 implies 
Xl c as sets. 

Also, by expanding the Gaussian about the origin using a Taylor series we can show that as sets 

oo 

X%(ZH°° = n -H'"' where if" is the Sobolev space of order n (Definition 80). 

n=0 
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Sobolcv splines 

It was shown in Examples 1.2.3 that property W2 holds for < k < v — d/2. Now by Remark 16, 
X° ^ ^(max[Kj) ^j^^^ max Ik\ — [v — d/2\ when v — d/2 is not an integer and max [kJ = 

[v — d/2 —l\ = \y — d/2\ — 1 when v — d/2 is an integer. 

Also, from 1.16, w {£) = (l + ICI^)" and so 1.23 implies X% = iJ" where if" is the Sobolev space of 
positive order v defined using the Fourier transform. 

Extended B-spline weight functions 

These splines are defined in Theorem 7. Since property W2 holds for all < k < n — 1/2 it follows that 
max [kJ = n — 1, and thus X° C^~^' by Remark 16. 

In Lemma 89 we will show that X° ^ iJ". Here i?" is the 'Sobolev-like' space introduced in Definition 
81 below and it consists of all L'^ functions such that D"f e L'^ when each aj < n. 

1.3.4 Some dense C°° subspaces of 

The results of this subsection will not be used later in this document. 
Results to handle the weight function discontinuity set 

In this section we prove some results needed when we want to use a partition of unity to handle the 
discontinuity or zero values of a weight function on a set of measure zero. Recall that weight function 
property Wl introduced in Definition 1 required that a weight function be continuous and positive 
outside a closed set A of measure zero. Light and Wayne assumed their basis functions were continuous 
and positive outside the origin i.e. A = {0}. In order to allow hat functions to be basis functions we have 
had to allow the weight set to a closed, unbounded set of measure zero. 

The discontinuity of the weight function needs to be taken into account when we prove the density of 
some C°° subspaces in and , and when we prove the smoothness of basis functions generated by 
weight functions which have property W2. 

In this document we will use the next lemma whenever we want to construct a partition of unity using 
points 'near' a weight function set. This lemma will be used with J-' — A, where A is the weight function 
set. Lemma 18 will be applied to a function of the 'near' points. Nearness can be measured using the 
concept of a neighborhood of a set. In fact, if ^ c M** and 77 > then define J^ri = [J B {x; rf). The set 

J^ri is referred to as the r]- neighborhood of T. 

Lemma 17 Let T he a set of points in W^. Then for any r] > there exists a function f^ such that: 

1- frj G C^, 

2. 0<fr, (X) < 1, 

3. frj (x) = 1 when x £ J^ri! 
4- fn {x) = when x ^ ^3^. 

Proof. Choose r] > 0. There exists a mollifier w e Cq' satisfying 



suppw C B (0; 1) ; J LO = 1; w > 0. 



Now define {x) = rj "^co [x/rj) and let X2r] be the characteristic function of the set ^277- Then it can 
be shown that the function 

fv i^) = j X2r) (y) u)r, {x - y) dy , 

has the required properties. ■ 

The next theorem will require the following standard measure theory results which are stated without 
proof. 
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Lemma 18 1. Suppose the set A is closed and has measure zero. If Ae is the e— neighborhood of the 
set A then for any open ball B, meas {B (lAe) ^ as e — > 0. 



2. Suppose the set A is closed and has measure zero. 
Then f G implies, limj^ \f\ =0. 

Some spaces of continuous functions which are dense in 



The next theorem is an extension of Theorem 2.5 of Light and Wayne [8]. The difference is that instead 
of the set {0}, we are deaUng with a set A which is closed and has measure zero. There is no need to 
assume that e ^. To handle this set we use Lemma 17 and Lemma 18. 

Theorem 19 Suppose w is a weight function with properties Wl and W2 with respect to weight set A. 
Then the set | ^/wf : f e C^°^ | fl is dense in L? . 

Here Cg"^ denotes the continuous functions with compact support. 
Proof. In this proof we will use the abbreviation 

By Lemma 17, given /i > 0, there exists a function iph G C°° satisfying < 'i/l'?^ < 1, i/^?^ = 1 on Ah and 
ij^h = outside Aah- Now define the mapping '■ Cq^^ — * C'o'^^ by 

^>h9 = il-^Jh)g, geC^"\h>0. 

Since w is positive and continuous on M'^yyl, we have '■^^''^ € C^°^ and hence (1 — tph) 9 € y/wCo^^ DL"^ . 

In other words, ■ C^°^ V^Cf^ H L^. 

Now Cq"' is dense in L^, and given go e L"^ and e > we can choose g^ G Cq"^ such that \\go — 6'e||i2 < 
e/2. Then 



\\90 - ^h9e\\L^ < ho - 9s\\l2 + - */»5e|li2 < e/2 + ||V/»5e|L2 <£/2+i J 



I |2 
We 



Finally, since Ig^f G L^, it follows from Lemma 18 that lirn J^^^ Ig^f = 0. Thus we can choose h> 

so that llfifo - ^hOeWL^ <£. m 

The next theorem improves the previous theorem. This is Light and Wayne's Proposition 2.7 and our 
proof is a variation of Light and Wayne's proof. 

Theorem 20 Suppose w has extended weight function properties Wl and W2 with respect to weight set 
A. Then the set {\/wf : f e Cg"} fl is dense in L'^. 

Proof. In this proof we will use the abbreviations, 
V^C^ n = {^f : / e c§°} n and y/iBC^°^ nL'^ = {^/Sy/ : / e C^°^} n L2. 

Now ^/wC^ nL^ d VwCll^^ n L^. Hence, by Theorem 19, if it can be shown that ^/wC^ n is 
dense in y^Cq"' n L^, this theorem is true. First note that C^' is dense in and hence dense in Cq^\ 
Next select e > and define a mapping : Cq°^ — * Cq° where Qef is an element of such that 

Il/-e,/||^. <e. 

We want to use 0e to construct a mapping (see left side of 1.29) from s/wC^^ fl to -JwC^ n L^, 
which can be used to prove that ^/wC^C\LF' is dense in ^/wC'^'' r\Lp' . If / G ^/wC'^^ HLF' , then ^ e Cq"^ 

and ^/wQs (:^) ^ ^/wC^ , 5 > {). But we still need y/wQs (:^) ^ do this we use the function 

ijjh defined in the previous Theorem 19. Since (1 — V'ft.) ^/w G C^^^ we have 

(1 - il)h) V^Qs [4=] ^ \/^C'o° n i^ when f G s/^c'^^ n L^. 
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Finally, it will turn out that the support of (1 — tph) y/ui'ds (^"^ j needs to be restricted by multiplying 
it by a function (pR e , R > 1, such that < (pR < 1, supp0_R C [—R— 1] and (pn = 1 on 

[-R,R]. 

We now assert that there exist R, h and S such that 

< e. To prove this assertion write 



(i-Vh)</>fiv^e,(^)-/ 



L2 



(1 - Iph) (PrVw&S f ^ ) - / = (1 - V'/i) (PrVw&S ( ^ ) - (1 - ^h) (PrVw \^J= 



+ {l-i'h)^RV^(^ ]-M + <t>Rf-f 



= (1 - <i>RVw [ Qs [ --j=j - ] + Tphfl'Rf 



!)/• 



Observe that (1 — tph) \/w G C^'^^ implies (1 — iph) (pR^/w G Cq^\ Hence 

(1 - cp.V^es (^) - /[^ < II (1 - ^nV^ (e^ (;^) - ^ 



+ 



L2 



+ Uhfh^ + mR-l)fh^ 

f 



< (1 - tph)(i)RVw\\ 



I 



i2 



+ \\M\L^ + \\{<PR-l)f\\L^ 

< 11(1 - ^h) ^rML ^ + WMl- + \Ur - 1) ■ 

We will now consider each term on the right side of the last inequality separately. Regarding the last 

1/2 



term. 



= (/|.|>i^l/l') 



and so R can be fixed so that 

1/2 



1),/|Il2 < e/3. Next, 



by definition of V'/i, IIV'^i/IIl^ ~ {Ja^k I'^I j a.nd, since |/| e L^, it follows from Lemma 18 that 
lim l/p = 0. A value of h can now be chosen so that ||V'/i/llz,2 < e/3. Now we have 



< ||(l-V/.)</'fiV^|L (5 + 2£/3, 



(1.29) 



forall/e\/^IJC^°^ni2. 

The last step is to choose 5 so that ||(1 — iph) ^flV^lloo ^ — ^/"^' ^'I'l theorem follows. 



1.3.5 Some dense and S subspaces of 

The next result corresponds to Light and Wayne's Corollary 2.8. Here X° is Light and Wayne's space 
Y and Cg° denotes the space of Fourier transforms of all functions in Cq' . 

Corollary 21 Suppose w has properties Wl and W2 i.e. w € C(°) {R'^\A) andw>0 on R'^\A, where 
A is a closed set of measure zero. 

Then the spaces X% n C°° and X ° n 5 are dense X° . 

Proof. Let I : be as given in Definition 10 i.e. If = sjwf . Then 

I [xl r\Cf)^ {v^/ : / e and / e = {v^/ : / G n i^. 

Thus, by parts 1 and 2 of Theorem 11, X is an isometric isomorphism from n to 
jv^/ • / G '^0°! 1^ Since we know by Theorem 20 that the last set is dense in L^, we have the 
density of n Qf in X^. 

Clearly, since n C n 5, n S' is dense in . ■ 
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1.4 Basis functions 

In this section wc will define a (unique) basis function of order zero generated by a weight function with 
property W2, and note that this contrasts with the positive order case where the basis function is only 
unique modulo a space of polynomials. Basis functions are generally denoted by G and will always be 
members of X^. If a weight function has property W2 for parameter k it will be shown that G G cjj^^"^\ 
whereas in general we know that X° c C^^"^-*. Basis functions are used to construct solutions to the 
variational interpolation problem of part 2 of this document and to the two smoothing problems studied 
subsequently in Chapters 4 and 5. 

We will also derive results concerning the basis functions generated by the tensor (or direct) product 
of weight functions with reference to the example of the extended B-sphne basis functions introduced in 
Subsection 1.2.7. 

Definition 22 Basis function 

Suppose a wej(]ht function w has property W2. Then — G and hence by Lemma 13, {^Y e cf. 
We now define the unique basis function G of order generated by w to be 

G=(i)\ (1^30) 



A simple consequence is that G{x) = G {—x). 

The next theorem makes three important points: (1) unlike in the positive order case basis functions 

are unique and are data fimctions i.e. they are members of X^,, (2) scaling can be used to enlarge the 
classes of weight functions available and (3) in the case of the extended B-splines scaling can dramatically 
improve the performance of the interpolant. 

Theorem 23 Suppose w has property W2 for k. Then: 

1. The basis function G generated by the weight function w is a member of X^. 

2. If X> then the weight function X'^w (j) has property W2 for niffw has property W2 for k, and 
G{Xx) is the basis function generated by X'^w [j). 

Proof. Since C^°^ C S' it follows that GeS',Ge L}^^ and V^G = ^ e i.e. G & X^. 

Further, G{Xt) = jjG [j) = ^ so the corresponding weight function is X'^w [j). But by part 6 
of Remark 2 this is a weight function with the same parameter k as w. m 

1.4.1 The smoothness of basis functions 

Not only is the basis function a data function but all basis function derivatives up to order k are data 
functions: 

Theorem 24 Suppose the weight function w has property W2 for parameter k. Then the basis function 
G generated by w is in C^'^'^^ ' (W^) . Additionally, the inverse Fourier transform formulas 

D^G (x) = J e'^^DPG (C) < 2k, (1.31) 

hold and D^G € X° when < k. 

Proof. Since property W2 implies ^ & L and -^-j^p G L , an application of the formula of part 2 of 

1 ( 1 2^ I ) 

Lemma 13 with f — — proves G G OA . Formula 1.31 then follows from equation 1.25 of Lemma 13. 



Now to prove I?^G G X^^,. Firstly, G G S" implies D'^G G S' . Next, for compact K, Jj^ 
Ik 4!r" ^ since ^ G L^. Finally, for < k, 1.26 implies 

^2/3 f |.|2|/3| 



DI^G 



< 



since w has property W2. ■ 
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Remark 25 

1. If the set of values of 2k has a finite upper bound then although max 2k may not exist e.g. the 
extended B-splines example below, max [2«;J always exists and G G f7^a,x[2K;j) 

2. Allowing k to take non-integer values in the definition of weight property W2 (Definition 1) can 
yield an extra order of smoothness e.g. the Sobolev spline example below. 

1.4-2 Examples: radial basis functions 

The radial functions used in Examples 1.2.3 to generate weight functions (generally denoted by G), 
namely the shifted thin-plate splines, Gaussian and Sobolev splines (univariate Laplacian kernels) can be 
used as radial basis functions. The basis functions associated with the extended B-spline weight functions 
will be derived below in Subsection 1.4.4. 

Shifted thin-plate splines 

The basis functions are given by 1.8 where —d/2 <v <Q i.e. 

0(0;) = (-1)^"! (l + |a;|^y , -d/2<v<Q. (1.33) 

The weight function has property W2 for all k > 0. Hence by Theorem 24, the basis function lies in 
(-^U.2kJ) for k > and so is a function. 

Tlio Gaussian 

The weight function 1.14 has property W2 for k > and the basis function is given by 1.13 i.e. G {x) = 
e-l^l^ Hence by Theorem 24, we have G e C^^^"-'^ for k > and so G e C^. In fact G e S. 

Sobolev splines 

The basis function is 1.15 i.e. 

g(^) = 2^-ir(^) ^.-d/2(N), xGR", 

where V > d/2. It was shown in Examples 1.2.3 that property W2 holds for all < k < f — d/2. 
Now by Remark 25, G G (7^^''L2kJ) ^^^^ max [2kJ = [2u — d\ when 2v is not an integer and 

max [2k\ = [2w — dj — 1 when 2v is an integer. This is an example where allowing k to take non-integer 
values in the definition of weight property W2 (Definition 1) yields an extra order of smoothness. An 
example is when v — d/2 = 1.6. 

1.4.3 Basis functions generated by tensor product weight functions 

Here we prove that the basis function of a tensor product weight function is the tensor product of basis 
functions. 

Theorem 26 Suppose wi andw2 are weight functions which satisfy property W2 for parameter k. Then 
part 2 of Theorem 4 implies that w = wi®W2 is a tensor product weight function satisfying property W2 
for parameter k. 

We prove here that G = Gi (g) G2 where Gi , G2 and G are the basis functions of order generated by 

Wi , W2 and w respectively. 

Proof. Here each Gi € Cg ^ and so each Gi G S' and we are interested in the tensor product of members 
of S". In this proof we refer to results from Vladimirov [12] where the term direct products is used instead 
of tensor products and S' is referred to as the space of generalized functions of slow growth. In Subsection 
2.8.5 of Vladimirov it is shown that Gi ® G2 S S' and in part (e) of Subsection 2.9.3 it is shown that 
G = Gi®G2. Thus 

G = Gx®G2 = — ® — = = — . 

Wi W2 Wi ^W2 W 
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Corollary 27 Suppose that {wi} is a set of weight functions which satisfy W2 for parameter k. Then by 
part 2 of Theorem 4 w = 0Wi is a tensor product weight function satisfying property W2 for parameter 

K. 

Further suppose that {Gi} is the set of basis functions of order generated by the Wi. 
We claim here that G = ®Gi is a basis function of order zero generated by w. 

Proof. By induction using Theorem 26. ■ 

One of the main motivations of this document is to define the weight functions so that tensor product 
hat functions are basis functions. The following corollary shows that the hat function is a basis function 
in any dimension. 

Corollary 28 Suppose A is the d— dimensional tensor product hat function. Then A is the basis function 
of order zero generated by the weight function i . 



1.4-4 The extended B-spUne basis functions 



In this subsection we will derive the basis functions of zero order generated by the class of tensor 
product weight functions studied in Subsection 1.2.7, namely the extended B-spline weight functions. 
The extended B-splinc weight functions were so named because their basis functions will turn out to 
be the derivatives of the convolutions of hat functions, denoted (*A)' , i = 1, 2, 3, . . . and these are the 
B-splines. The next theorem gives the unique tensor product basis function of zero order generated by 
these weight functions. 

Theorem 29 The extended B-spline basis functions Suppose w is the extended B-spline weight 

function considered in Theorem 7 and suppose w has property W2 for n i.e. k + 1/2 < n < I. 

d 

Then the basis function G of order zero generated by w is the tensor product G{x) = Y\ G\ {xk) where 

fe=i 



Gi {t) = (-1)'-" J^D^a-") ((*A)') (I) , i e MS 

and (*A)' denotes the convolution of I 1-dimensional hat functions. Further, if n < I we have 

\l — n l—n 



(1.34) 



(-1)' 



(27r) 



il-n)/2 



(*A)"-^ * 



E (-l)'(fin+1)A(--fc)> 

fe=-((-n) 



1 is a bounded, piecewise constant function with bounded support, and 
D^"Gi is the finite sum of translated delta functions. 

Finally, G € Gg^'^ (K'^), the {-D"G}|^|^2n-i '^'^^ bounded functions with bounded support, and 
the{D°'G}^^^^2n ^'''^ tensor product of bounded functions with bounded support, and finite sums of 
translated delta functions. 

Proof. In this proof it will be better to use the operator notation F [f] for the Fourier transform instead 
of/. 

Define wi (t) = J^2i ^ , i € K. Since the 1-dimensional hat function A satisfies 

2 



F[A]{t) = {2ny 



-1/2 



sm 



t/2 J 



t e 



we have 



Wl{t) 



sj^ = ) (2iii)2/ ^ (27r)'/^i2('-") iF[A] i2t)f 
= (27r)'/^i2('-")F [(*A)'j (2t) 
= {27, f (i)'^'""' [(*A)']) {2t) 

= {27rf^ (i)'^'""^ F (*A)'] {2t) 

= (-l/-"i5S^[^'^'-"^(*Ayl {2t), 
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Now since the parameters n and I used to define the weight function w are independent of the dimension 
d, wi satisfies properties Wl to W2 and so 1/wi G L^. Also 



Gi (t) = 



1 



{t) = (-1)'-" g^^F-i [f (*A)'] (2t)] {t) 



We will require the convolution identities 

^(•-a)*/ = (27r)-^/V(--a), f&V, 

and 



fe=— m 



Now suppose that n < I. Then in one dimension 

(*A)' = (*A)" * (*D2a)'-" = (*A)" * (* ((5 (• - 1) - 2(5 + 5 (• + 1)))'"" 



V / k——{l—n) 
l—n 

(*Ar^ ^ (_i)'=(^2(.-;))A(._fc). 

/c=-((-n) 



(27r) 



l — n 



(l-n)/2 



Further 



-l)'-"^(i)'"-W-")(*A)') Q 

-1)'-" ^ (i^^'- (*Ay) (0 

-i)'-"^((*i?^a)'-%a)(0 

-1)'-" ^ ((* {S{--l)-26 + 5{- + l))f-' * a) Q 



/-I /-I 



(27r) 



-(i-i) 



(27r) = fc=_(i_i) / 



fe=-(i-i) 

which is a continuous, piecewise linear function with bounded support. The stated properties of Gi 
and G now follow directly. ■ 

1.4-5 Convolutions and products of weight and basis functions 

The results of this subsection will not be used later in this document. In this subsection we give several 
results concerning the product and convolution of basis functions. The next result gives conditions under 
which the product of two basis functions of order zero is a basis function of order zero. 
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Theorem 30 Suppose the weight functions wi and W2 have property W2 for parameter Ki. Suppose Gi 
and G2 are the basis functions of order zero generated by the weight functions wi and W2 ■ Suppose that 
1/wi is bounded a.e. or G1G2 € ■ 

Then the product G1G2 is a basis function of order zero generated by a weight function w which satisfies 
property W2 for parameter k = min {ki, K2}. 

Further, w e C(°) and w{x) >0 for all x i.e. G^G^ (0 > for all ^. 

Proof. We first note that from Theorem 2.2 and Exercise 2.4 of Petersen [11], that if f,g G then 

f * g E L^, f * g = fg and, if / is bounded a.e., / * p is a continuous, bounded function. 

Now define the functions G and w hy — = — * — e and G = —. We now have f — ) = 

{^y {^y G ^ G,G2. 

If G1G2 e then ^ e cL°\ and if ^ is bounded a.e. then ^ * J- = i g c(°). Thus |w > c> 
for some constant c, and is continuous whenever w is finite. But wi > and W2>Q a.e. so w{x)>c for 
all x. 

The next step is to show that w{x) < 00 for all x. By definition of wi and W2 there exist closed 
sets of measure zero, Ai and A2 such that Wi is continuous and positive outside Ai. For each x G R"*, 
let Bx = {x — Ai) U A2 and note that Bx is closed with measure zero, and that, as a function of y, 
Wl {x — y) W2 {y) is continuous and positive on R'^\Bx- Thus, we have for all x 



1 

T372 



> 0, (1.35) 



w (x) (2^^) Jmd\i3^ Wl (x - y) W2 (y) 
and consequently that w (x) < 00 for all x i.e. w G (7^°^ (M). 

It remains to be shown that w satisfies property W2 for k = min {ki,K2}. Suppose < A < k. Then, 
for some constant c\ > 



I 



\x\^^ dx ^ f f \x — yf'^dydx f f \y\^^d.ydx 

w{x) J J Wl {x - y) W2 [y) '^^ J J wi{x- y) W2 {y) 

f \x — yf^ dx f dy f \y\^^ dy f dx 

= CA / 7 r / TT + CA 



Wl {x-y) J W2 {y) ' J W2 (y) J wi{x- y) 

2A „ , /. I |2A 



J Wl J W2 J W2 J Wl 



< 00. 



The last theorem can be expressed as the following weight function result. 

Corollary 31 Suppose Wi and W2 are two weight functions which satisfy W2 for k = Ki and k = K2 
respectively. Further, suppose that 1/wi is bounded. Define the function w by ^ = * 

Then w is a weight function which has property W2 for k = min{Ki, K2}. 

Further, w G C(°) (M) and w{x) >0 for all x. 

Remark 32 Any basis function can be uniformly pointwise approximated by a sequence of basis functions 
which have Fourier transforms that are always positive. 

In fact, suppose G is a basis function for which G {^) Q for all ^. Define the sequence of functions 
Gn iO = ^ (C/"^) G (^) , n = 1,2,3,.. .. Clearly, by Theorem 6 and the previous theorem, Gn is a basis 
function for K3 < min {1/2, K2} and Gn (0 > /o*" "^^^ ^• 

Further, Gn ^ G uniformly pointwise. This is true because the definition of A implies 



\Gn{0-G{0\<{ 



n 

|G(OI, 1^1 >n, 



and the definition of G implies lim G {£, ) = 0. 

A specific example is G = A for which ||G„ — G\\^ < 1/n. 
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The next result gives conditions under which the convolution of two basis functions is a basis function. 
This is equivalent to a result showing that the product of two weight functions is a weight function. 

Theorem 33 Suppose G\ and G2 are the basis functions of order zero generated by the weight functions 
Wl and W2 respectively, and that each Wi has property W2 for parameter Ki. Then the following two 
results hold. 

1. If Gi e L^, the convolution 

Gi*G2 = j Gi{x- y)G2 {y)dy, 

is the basis function generated by the weight function w = W1W2, and w satisfies property W2 for 

K = K2- 

2. If we further assume that for an integer Ui < Ki, D'^°'Gi G when \a\ = ni, then w satisfies 
property W2 for parameter K = n\ + K2 and 

D''+' (Gi * G2) = y D^Gi {x - y) D'G2 {y) dy, \^\ < m, \5\ < K2. (1.36) 

Proof. Part 1. Define the function w = w\W2- Clearly w is a weight function since if A\ and A2 are 
the weight sets, w{x) > Q and w G C^°^ outside the set ^1 L) A2, which is closed and has measure zero. 
Since Gi = — € C^^ we have — - — e and are able to define the function G = — - — . We now show 

^ Wl D W1W2 101102 

that G = Gi * G2- First observe that because G\ e and G2 S the convolution integral exists. 
Now, by definition of G2 



j Gi{x- y) G2 {y) dy = -j^^ J j Gi{x-y) ^^-j^d^dy, 
and, since that this integral is absolutely convergent, we can change the order of integration so that 
y Gi (x - y) G2 (y) dy = J J e^y^G^ {x - y) dyd^ 

The final step applies Corollary 3.7 of Petersen [11]. This states that if f G and f G then 
/ (x) = / e'^^/ (^) d(, a.e. In our case we choose / (a;) = and obtain 

/f e^'^^df 
Gi{x-y)G2{y)dy = ^ j ^^^-^^^^=G{x). 

Regarding property W2, if < A < K2 then, / < J_ ^ J H_ < 00. 

Part 2. When \a\ = m, we have = (-l)'"' ^^""Gi = (-1)"' ^ and e G^°'. Thus 

/!£= f\-l""\-f", j2 1 f^iifl,^,^^ 

J w J W1W2 a\ J Wl (^) W2 (0 

|a|=ni 

and since ^ G L^, w satisfies W2 for k = ni + K2- 

Suppose that I7I < niand \S\ < K2- Since Gi € C^'^'^^\ and 
/ D'^Gi {x — y) G2 {y) dy is absolutely convergent, we have 

(Gi * G2) = J D^Gi {x - y) G2 {y) dy = j D^Gi {z) G2 {x - z) dz. 

Since G2 G G*^^**^' and / D^Gi {z) D^G2 {x — z) dz is absolutely convergent we have our result. ■ 
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1.5 The Riesz representer of the evaluation functionals / (D'^f) (x) 

It was noted in Remark 16 that is a reproducing kernel Hilbert space. However, we will use the 
Riesz representer of the evaluation functional f ^ f {x) instead of the reproducing kernel because the 
former can be easily extended to the evaluation of derivatives. Thus in this chapter we calculate the 
Riesz representers of the evaluation functionals / {D"f) {x) and derive some of their properties. In 
particular, in part 6 of the next result we show that the inclusion C C^g ^ established in Theorem 14 

is continuous when is endowed with the usual supremum norm. The results of this section will find 
use in applications where, for example, higher degrees of differentiability of a smoother are required. The 
main tools used here are the inverse Fourier transform formulas of Theorem 24. 

Theorem 34 Suppose the weight function w has property W2 for some k. Then: 

1. The unique Riesz representer Rx € X° of the evaluation functional f ^ f [x) is 

Rx{z) = {2-k)~'^''^ G{z - x). (1.37) 

2. If \a\ < K then D^R^ G and the Riesz representer for the evaluation functional f {D°'f) (x) 
is {-D)"Rx. 

3. If \a\ < 2k then xi-x' implies D^R^ ^ D^R^'. 
4- If \a\ < K and |/3| < k then 

{D-Rx,D''Ry)^^^ ^ i-lf^ {D"+f'R,) (y) = {-1^+^^ {D-Rx,D''Ry)^^^. (1.38) 

5. If 111 < 2k then 

{D^R^} (y) = (-1)1^1 (D-fRy) (x). (1.39) 



6. max \\Dl^f\\^ < {2^)-"" max ^(-1)1^1 D'^PG (0) \\f\\^^„ f € X^ 



l/3|<«" l/3|<« 

Proof. Part 1. Prom 1.37 

^(0 = (27r)-'^/'e-^^«G(0 (1.40) 
= (2.)-/^-^. (1.41) 

By Theorem 24, G G X° and so 1.40 implies R^ G X^. Further, using 1.41 and then the inverse 
Fourier transform result 1.27 for functions in X^ we obtain 

(/- ^x)^,o = / «^/^ = i'^^r"'^ J e'"^/ (0 d^ = f{x). 
Part 2. If \a\ < k then 5^ = {i^f R^ = {2n)~'^/^ ^'^^^{q"^^ so that 

^ (0 (2vr)-* ^""l'^^ < {2n)-' J < oo, 

by weight function property W2. Further, for / e X^ and using the inverse Fourier transform result 
1.27 for functions in X^ gives 



(27r)-« 1 e-"« {liffiOd^ 
(27r)-'^/^y e-'^«D«f (O^C 
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Part 3. Suppose for some x ^ x' we have Rx — Rx'- Taking the Fourier transform and substituting 
G = ^ we must have e~'^^ = e~'^^' or e'^(^~^ ) = 1 for almost all ^, a contradiction. 

Part 4. Since we know that G e C^"^"-^ the definition of Rx implies Rx € C^'^"-^ and 

5^ (0 = (27r)^ (-i)'"' e--€rG (0 = (27r)^ (-i)'"' I"! < (1-42) 

Next note that Theorem 24 implies that for all x, and D'^Rx G X° when |7| < k, so the first and last 
terms of 1.38 make sense. Now using equation 1.42 

{D-Rx,Df<Ry)^^^ = J wD^xD^y = {-lf^ {2n)-'J {-i^f+^^^^^d^ 



w{0 

= i-lf^ (277)-"^ y {-iO"^^e'^y-^^^Gi^)d^ 
= (-l)l''l (27r)-'^ {D'^+f'G){y-x), 



where the last step used the inverse Fourier transform rule 1.31 for basis functions. Finally, substituting 
1.37 gives {D'-Rx,DPRy)^^^ = {-if^ (£>«+^i?.) {y}- 
Regarding the second equation: 

D^x (0 = D-gT^ x) (0 = (-1)1"! {D-GUT- ar) (0 = (-1)1"' e--«^ (0 

= (-1)1"! x) (0 

= (-1)1"! 5^(0, 

so that 

{D'^Rx,D^Ry)^^ = (-l)l«+'^l {D"Rx,D^Ry)^^^. 

Part 5. If I7I < 2k then 7 = a + /3 for some a and /3 satisfying |a| < k and |/3| < k. Applying part 3 
twice we get 



D^Rx (y) = D"+^Rx (y) = {-if^ {D'^Rx, D^Ry)^^^ = {-if^ {DPR,, i?-i?.,.),„,o 

= (-l)l«+^l D^+^Ry{x) 
= {-lp DiRyix). 



f,{-DfRx) ^ 



when 1/31 < /t so that 



Part 6 From part 2, = sup 

< sup ||£>'3ii.||^ = sup J{D0Rx,D0Rx)^^„\\f\l^, 

= sup ^(-1)1^1 {D^PRx)ix) ||/L,o- 
But from part 1, (D^f^Rx) {x) = (27r)"^ D'^'^G{0) and thus 

ll^^'^/L < i^^r'^'^i-lf^D^PGiO) ||/|L^o> 
which implies this part. ■ 
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1.6 More continuity properties of the data functions 

The Ricsz representors of the cvahiation functionals / f {x) discussed in the last section can be used 

to prove some local, pointwise smoothness properties of the data functions and their derivatives e.g. 
Lipschitz continuity. The basis function G generated by w also lies in and is considered separately. In 
part 2 these local properties will be used to derive the order of convergence of the variational interpolant 
to its data function. 



1.6.1 General results 



In Remark 38 we observe that the right sides of all the estimates derived in this subsection can be written 
in terms of even order derivatives of the basis function evaluated at the origin. 

Theorem 35 Suppose w is a weight function satisfying property W2 for parameter k, G is the basis 
function and Rx is the Riesz representer of the evaluation functional f ^ f (x) on X^. Then for \a\ < k 



(2;r)I 



^J{-lp {D^-G (0) - Re {D^-G) {y - x)). 



Further, if k > 1 and |a| < k — 1 then we have the bound 



(2u)7 



w{0 



dn \x-y\ 



where 



J ^^^d^ = (-1)1+1"! (27r)* (D^^V^G) (0) . 



Proof. If I a I < K then from the results of Theorem 34 

WD'^Rx-D'^RyWl^^ 

= {D^Rx — D^Ry, D^Rx — D"Ry)^^Q 

= {D"Rx,D"Rx)^^, - {D<^Rx,D<^Ry)^ , - {D^Ry, D'^Rx)^^,+ 

+ {D'^Ry,D'^Ry)^, 

= (-l)l«l [{D'"Rx) (x) - {D'"Rx) {y)-W^Rxny) + (D'^Ry) (y)) 
= (-1)1"! {2Tryi [d^^G (0) - {D^'^G) {y - x) - {D^^G) {y - x) + D^"G (0)) 
= (-1)1"! {2Tr)-i (^D^"G (0) - {D^"G) {y - x) - {D^»G) {y - x) + D^-^G (0)) 
= (-1)1"! 2 (27r)-^ (£)2°G (0) - Re {D^"G) {y - x)) . 



The proof of our second result uses equation 1.42 i.e. D°'Rx (^) = 



(277)2 



so that 



\D°'Rx - -D"-Ry||^ - j 



w 



D^'Rx - D"R^ 



1 2 



2 sin 



2 



w{0 ^ 



(1.43) 



(1.44) 



(1.45) 



< 



dU\x-y\ , 



which is finite since 1 + |q:| < k. Continuing 

/ ^-^d^ = J l^l' ^ (0 = (-1)'+'"' J D^^G = (27r)^ (-l)i+l«l (£)2av2G) (0) . 
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We now prove some uniform pointwise estimates for functions in X^. 

Corollary 36 Suppose the weight function w has property W2 for parameter k and that G is the basis 
function of order 0. Then we have the following local pointwise estimates for functions f G X^: 

1. If K>1 and \a\ < k — 1 then 



\D'^f{x)-DV{y)\<-^-^\\f\\^, 

(2ir) 2 



w{0 



-d^] \x-y\. 



2. If \a\ < K then 

ID'^f {x) - D"f (y)| < ||/IL,o V (-1)'"' (D^'^G (0) - Re {D^-G) {y - x)). 

3. For the hat weight function in one dimension k < 1/2 and 

|/(ar)-/(y)|<-^||/IL,ok-yr, ^,2/eK\ \x-y\<l. 

Proof. Part 1 This is an application of inequality 1.44 of Theorem 35 

\D'^f{x)-DV{y)\ = \{f,D'^R^-D'^Ry)^^^\ < \\f\\^^, \\D'^R, - D'^Ry\\^ ,^ 



< 



T 

(27r)^ 



(1.46) 



w{0 



da \x-y\. 



Part 2 By part 2 of Theorem 34 

\D-f (x) - 2?"/ (y) I = I (/, - I?"E,)^_o 

and inequality 1.43 of Corollary 35 completes the proof. 



< 



WD'^R^-D'^RyW^^^, 



Part 3 Prom Theorem 6 k < l/2soa = and since G{x) = 1 — \x\ when \x\ < 1 our result follows 
from the bound proved in part 2. ■ 

The next corollary provides information concerning the smoothness properties of basis functions. 

Corollary 37 Suppose G is the basis function generated by a weight function w satisfying property W2 
for parameter k. Then for each e M"^, G has the following properties: 

1- If \oi\ < K and \P\ < K then 



\D°'+>^G{x) - D°'+^G{y)\ < k^J (-l)'"' {D'^^G (0) - Re (£)2«G) {x - y)), (1.47) 



where 



2. If \a\ < K then 



and 



'^^il'-^y^ (0). 

\D^"G{x) \ < (-1)1"! D^"G{0), 
(-l)l"lRe£>2«G(a;) < (-l)'"' iP^^G (0) , x^O. 



(1.48) 
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3. Suppose K>1. Then if\a\ < n — 1 and \ j3\ < n — 1 

Proof. Part 1. Our starting point is equation 1.38 i.e. for x,y & W^, 

{D'^R^,Dl^Ry)^ Q = (-l)l'^l {D°'+PR^) [y) when \a\ < k, \f3\ < k. Hence, since 

R,{y) = {2w)-^ G{y-x) 

D^+^ {G{z-x)-G{z- y)) = {2n)-^ (i?, - Ry) (z) 

= {27r)-^ {R^-Ry),Df'R, 



\x-y\- 



w.O ■ 



SO that 

\D'^+f' {G{z-x)-G{z- y))\ < (27r)-^ \\D'^ [R, - Ry)\\^ , \\D^R.\L o • (^-^Q) 



\w,0 ' 

From equation 1.42 wc conclude that HZJ^^iJ^H^ ^ = (27r)~^ ^ / ^w{(,) ) ^ then using equation 1.43 
for ||L>" {R^ - Ry)\\^^Q incquaUty 1.47 foUows. 

Part 2. We have (-1)'"' D'^'^G{x) = (27r)"^ / e*«^^rfC, so that 

|l?'"G(x)| < (27r)-* J ^|^de= (-1)'"'d'"G(0). 

Prom part 1 we know that (-1)'"' Re {D'^"G) {x - y) < (-1)'"' £>^"G (0). Now suppose that 

(-1)'"'Rc (ZJ^^G) (z) = (-1)'"' i:>2«G(0) for some z 7^ 0. Equation 1.43 would then imply that 

D'^Rx = D'^Rz+x for all .t, and hence z = by part 3 of Theorem 34. 

Part 3. Our starting point is inequality 1.49 of part 1. Substitute the expression for ^ given 

in the proof of part 1 and then use inequality 1.44 to estimate \\D°' {R^ — Ry)\\^ q- ■ 

Remark 38 Observe that by using 1.45 and I.48 the right sides of all the estimates involving weight 
functions in this subsection can be written in terms of even order derivatives of the basis function eval- 
uated at the origin. 

1.6.2 Better results obtained using Taylor series 



We can improve on part 3 of Corollary 37 for the special case k > 1, y = and a = /? = 0; improved in 
the sense that |a;| is replaced by |a;| and this is done by using the Taylor series expansion with integral 
remainder given in Subsection A. 7 of the Appendix: 



Theorem 39 Suppose the weight function w satisfies property W2 for k = 1. Then (V^G) (0) is real 
and negative and we have the bound 

G(0)-ReG(a;) < -^(V^G) (0)|a;|\ x&W^. (1.50) 

Now suppose w is radial and let r = \x\. Then: 

1. IfG{x) =g{r) then g e G^^^ ([0,oo)) and r-^g' G ([0,oo)). Also 

(V2G)(0)=5"(0)rf. (1.51) 

2. IfG{x) = f (r2) then f € G^^) ((0, 00)) n G^j^^ ([0,oo)) and rf" (r) € ([0,oo)) and 

hm rf" (r) = 0. Also 

(V2G)(0) = 2/'(0)rf. (1.52) 



34 1. Weight functions, data spaces and basis functions 

Proof. We start by assuming w is an even function. Because k = 1 it follows that G G C^^"^'* C C^K 
Also, since 

\0 



D^G (x) = {2n)-i J e--«^dC, < 2, 



and w is even, it follows that G is real, \G {x)\ < G (0) and D^G (0) = for all k. Thus from Appendix 
A. 7 we now have the second order Taylor scries expansion 

G (x) = G (0) + (7^2G) {0,x), 

with remainder estimate 

/ 



|(7^2G)(0,x)|<^ 



max 

1/31=2 
\*6[0,a;] 



But when \(3\ = 2, D^^G {x) = - (27r)"3 / e-'^«^d^ and hence 

\D^G{x)\ < {2ny^ I = {2n)-^ J J^d^ = - (V^G) (0) . 



so that 

G{0)-G{x)<~{W^G){0)\xf , x€W^. (1.53) 
Now set aside the assumption that w is even. We define the even function Wg by 

and it is easy to show that Wg has property Wl and satisfies 

/ -^d^ = I ^T^d^^ < A < (1.55) 

so that We has property W2 for k. Further, if We has basis function Gg then Gg = i (g + g) and 

hence Ge = ReG. But Gg satisfies 1.53 and since 1.55 implies (V^Gg) (0) = (V^G) (0) we can conclude 
that 1.50 is true. 



Part 1 Suppose G{x) = g (r). Then substituting x = (r, 0') we have 

g{r)=G{r,0'), 

so that it is clear that g e C^^ ([0, 00)) since G G Regarding r~^g', it is clear that r~^g' e 

gL^^ ((0,00)). Further, since g' e C^2^ ([0,oo)), lim sllll = g" (0) and lim ^ = which means that 

r— >0+ r— >(X) 

r-VeClj'^([0,oo)). 
If X ^0 

DkG{x) = ^g'{r), 

and 

DjD.G (x) = ^g" (r) - ^g' (r) + ^g' (r) , 

so that 

V^G {X) = X: DlG (x) = (r) - ^ + = g" (r) + (d - 1) 



fe=i 



But e G^^^ ([0,00)) so 



lim_ V^G (x) = / (0) + (d - 1) g" (0) = 5" (0) d. 

X— >0 



1.6 More continuity properties of the data functions 
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Part 2 Suppose G [x) = f (r^) . Then the equation / (r) = g (\/r) and the properties of / proved in 
part 1 can be used to easily derive the results of this part. ■ 

For radial basis functions the estimates the last theorem for G (0) — Re G (a;) give a dependency. 
However, if the basis function is a radial function this dependency can be avoided: 

Theorem 40 Suppose the weight function w satisfies property W2 for k = 1 and denote the basis 
function by G. Then: 

1. If G (x) = f {r'^) then 

G(0)-ReG(a;)< ||2r/" + /'||^|:r|% xgR". (1.56) 



2. IfG{x) =g{r) then 



GiO)-ReGix)<^\\g"\\^\x\\ a; e (1.57) 



«;ftere||2r/" + /'||^ = i||5"IU. 

Proof. We will start by assuming w is an even function. Because k > 1 it follows that G e Cg'^'^'^^ C C^ K 
Also, since 



Df'G {x) = (27r)-« j e— < 2 



it follows that if w is even then G is real, |G(x)| < G(0) and DkG {0) = for all k. Thus from 
Appendix A. 7 we now have the Taylor series expansion 

G (x) = G {0) + {■R2G) {0,x), 

where 



(TeaG) (0, x) = 2 V — / (l-t) {D^G) {tx) dt. (1.58) 

1/31=2 J° 



Part 1 1 Suppose G{x) = f (r^). Then 

DkG {x) = 2xkf' (r^) , 

and 

D.DkG (x) - 4x,Xkf" (r^) + 26,,kf' (r^) . 
Set 5/} = 52,max/3- Then in multi-index notation 

Dl'G {x) = Ax^f" (r^) + 2dpf' {r') , |/3| = 2, 
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and so, using the identity A.l, the remainder can be written 

r/3 fl 



{n^G) {0,x) = 2j2^ f (l-t) {4x^f" (r^)) (te) dt+ 

|/3|=2 

+ 2E ^ fa-t){2Spf'ir')){tx)dt 

|/3|=2 ^' 

|/3|=2 



The remainder can be estimated as 



= (1 - t) er dt + 2r2 (1 - t) f (fr^) dt 

Jo Jo 

C (1 - t) {U\^f" (iV^) + 2/' (iV^)) dt. (1.59) 
Jo 



\{n2G) (0, < C (1 - i) |4iVV" (iV^) + 2/' (tV^) | dt 
Jo 

= r2||4r/" + 2/'|U [\l - t) dt 
Jo 

= Prf" + f\\ooM'- (1-60) 

so that we have proved the desired estimate 1.56 when w is an even function. This estimate can now be 
extended to an arbitrary weight function by the technique used in Theorem 39 which involved defining 
the even weight function 1.54. 



Part 2 I Suppose G{x) = g (r). Then g[r) = f (r^) and 

5"(r) = 2/'(r2)+4rY'(r-^), (1-61) 
so the remainder equation 1.59 can be written 

(7^2G) (0, x) = (1 - t) {U\^f" (tV^) + 2/' (tV^)) dt = (1 - t) g" (tr) dt, 



and estimated by 



|(7^2G)(0,a:)|<-||.g"|lool^^ (1-62) 



Equation 1.61 ensures that the estimates 1.60 and 1.62 are equal. 

We have now proved the desired estimate 1.57 when w is an even function. This estimate can now be 
extended to an arbitrary weight function by the technique of Theorem 39 which involved defining the 
even weight function 1.54. ■ 

Remark 41 Calculations using part 1 may he easier when the basis function depends on r^: see the Type 
1 convergence estimates for the radial functions of Subsubsection 2.5.2. 
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Inequality 1.47 shows that the smoothness of the basis function near the origin imphes its global 
smoothness. Specifically, for the one-dimensional hat function A we have k < 1/2 and near the origin 
|A(x) — A(7/)| < \/2\x — y\^^'^ by part 3 of Corollary 36. However, A actually satisfies the stronger 
estimate |A {x) — h.{y)\ < \x — y\ everywhere i.e. it is uniformly Lipschitz continuous on and Theorem 
43 will generalize this result. First we will need the following extension to distributions of the Taylor 
series expansion with integral remainder given in Appendix A. 7. 



Lemma 42 Suppose u e ^ 
Then 



and the distributional derivatives {ZJ'^'u}, ^.^ are hounded functions. 



u{z + h) = u{z) + (TZiu) {z, h) . 
where TZiu is the integral remainder term 



{n,u){z,b)=Y. {D^u){z+{l-t) 



b)dt, 



which satisfies 



l/3|=l 



|(7^lu) {z,h)\ < V^max \b\ . 



(1.63) 



Proof. In order to overcome the fact that D'^u may not be C(°) (K'*) when |/3| = 1, we will use a Taylor 
series expansion with remainder for distributions. Suppose (j) G Cg^ . Then the conditions on u render 
all the integrals absolutely convergent and allows us to apply Fubini's theorem to swap the order of 
integration in the calculations of this lemma: 



[u{z + b),<P{z)] = [u{z),<P{z-b)] 



W'E T I't'^iDU) {z + {l-t){-b)) 
1/31=1 



dt 



= [u(z),<^(z)]- ^ -^«(z),y^ {D'',l>){z-{l-t)b)dt 



(1.64) 



l/3|=l 

We now analyze the integral remainder term of 1.64. When \/3\ = 1 

'.{z), j {Df^cf)) {z-{l-t)b)dt = [u (z) [ r {Df^cf) {z-{l- t) b) dt dz 
Jo i J Jo 

= j u{z) {Dl^<t)) {z-{l- t) h) dz dt 

= C [u{z),{D^4'){z-{l-t)b)\ dt 
Jo 

= [ [u{z + {l-t)b),D'^4>{z)] dt 
Jo 

= (-1)1^1 [\{D^u)iz + {l-t)b),cP{z)] dt 
Jo 

= - j (D^u) {z + {l- t) h) (/) (z) dz dt 

= - J [D^u) (^ + (1 - t) b) dt (p {z) dz 

I {D^u){z+{l-t)b)dt,(l){z) \ , 
Jo 



so 1.64 now becomes 



[u{z + b),<t> {z)] = [u {z) , <i> {z)] - ^ - {D^u) {z + {l- t) b) dt, ct> {z 



l/3|=l 
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for (/> e C^. Thus 

u{z + b)=u{z)+ 1^ (^^w) + (1 - t) b) dt, 

|/3|=1 ^' "'O 

as claimed. Finally, the estimate 1.63 follows directly from the integral remainder estimate A. 3 of the 
Appendix. ■ 

We now prove that all derivatives of the extended B-spline tensor product basis function up to order 
2n — 2 are uniformly Lipschitz continuous of order 1. 

d 

Theorem 43 Let G {x) = Y\ Gi (xk) be an extended B-spline tensor product basis function, as intro- 

k=l 

duced in Subsection 1.4-4- We then ha,ve the estimates 

\G{x)-G{y)\<VdG,{0f-'\\DG,\\^\x-y\, x,yeR^, 

and if \a\ < 2n — 2 

|£>«G(ar)-£>«G(2/)| < VdxAx\\DkD"G\\^\x - y\ . (1.65) 

k—1 

Proof. Prom Theorem 29 we know that G e C^^""^^ (R^) and that the derivatives {D'^G}\^\^2n-i 

bounded functions. Consequently G G Cq^^ (K'') and all the first derivatives are bounded functions. This 
implies u = G satisfies the conditions of Lemma 42 with z — y and b = x — y so the estimate of that 
lemma holds i.e. 

\G{x)-G{y)\ < \/dmax\\D^G\\^\x-y\. (1.66) 
Again by Theorem 29, d e C^°^ (R^) and DGi is a bounded function so that 

\G (x) -G{y)\<Vd \\G^f-' \\DG,\\^ \x-y\. 

Finally, part 2 of Corollary 37 tells us that HGiH^ < Gi (0) which completes the proof of the first 
inequality. To prove the second inequality we simply replace G by D°'G in 1.66. ■ 



2 

The minimal norm interpolant 



2.1 Introduction 

In Chapter 1 of this document we introduced the basic mathematical machinery: weight function, data 
space and basis function. In this chapter this theory is applied to the well-known minimal norm interpo- 
lation problem. This problem is solved and several pointwise convergence results are derived. These are 
illustrated with numerical results obtained using several extended B-spline basis functions and special 
classes of data functions. 

In more detail, the functions from the Hilbcrt data space X'^ are used to define the standard minimal 

norm interpolation problem with independent data X = {a;^*-*}^^ and dependent data y = {Vij^^i, the 
latter being obtained by evaluating a data function at X. This problem is then shown to have a unique 

N 

basis function solution of the form ^ a^G (• — a;^*') with a, G C. The standard matrix equation for the 
ai is then derived. 

We will then consider several categories of estimates for the pointwise convergence of the interpolant 
to its data function when the data is confined to a bounded data region. In all these results the order of 
convergence appears as the power of the radius of the largest ball in the data region that can be fitted 
between the X data points. The convergence results of this document can be divided into those which 
use Lagrange interpolation theory, and thus assume unisolvent data X (Definition 66), and those which 
do not. The non-unisolvency proofs are much simpler than the unisolvency-based results and if the data 
functions are chosen appropriately the constants can be calculated. However, the maximum order of 
convergence obtained is 1, no matter how large the parameter k. On the other hand, the constants for 
the unisolvency-based results are more difficult to calculate but the order of convergence is equal to the 
value of max \_k\ . 



Non-unisolvent data: Type 1 interpolation error estimates Suppose that the weight function has prop 



erty W2 for some k > 0, that the data X is contained in a closed, bounded, infinite data set K, and that 
the basis function G satisfies the estimate 

|G(0) - Re G(x) I <Cg \x\<hG, 

for some /ig > 0. Then in Theorem 59 it is shown that the interpolant Ixf of a data function / e X'^ 

satisfies 

1/ {X) - Ixf ix)\ < kc ll/IL n ihx,Kr , XGK, 



when hx K = maxdist {x,X) < he and kc = (Stt) * ^2Cg- This implies an order of convergence of 
at least s. 
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2. The minimal norm interpolant 



Non-unisolvent data; Type 2 interpolation error estimates Here we avoid making any assumptions 



about G and instead assume k> 1 and use the properties of the Riesz representer of the evaluation 
functional / — > / (a;). A consequence of this approach is the estimate (Theorem 65): when hx,K < oo 

1/ {x) - Ixf {x)\ < kc ||/IL,o hx,K, xeK, fexl 

where 

ka = (27r)-''/^ V- (V^G) (0). 
so the order of convergence is cdways at least 1. 



Unisolvent data - interpolation error estimates 



Here X is assumed to have a unisolvent subset of 
order max [k\ and is contained in an open, bounded data region SI. In this case it follows from Theorem 
74 that there exist constants hn,K, kc > such that 

\f{x)-Ixfix)\<kG\\fL^oihx,n)^''^ , xgU, fGXl 
when = sup dist < h^^K- 

Numerical results are only presented for the non-unisolvent data cases and these illustrate 
the convergence of the interpolant to it's data function. We will only be interested in the convergence of 
the interpolant to it's data function and not in the algorithm's performance as an interpolant. Only the 
extended B-splines will be considered and we will also restrict ourselves to one dimension so that the 
data density parameter hx^n can be easily calculated. If we can calciilatc the data function norm we can 
calculate the error estimate. All the extended B-spline basis weight functions have a power of sin^ x in the 
denominator and so we have derived special classes of data functions for which the data function norm 
can be calculated. The derivation of these special classes of data functions led to the characterization of 
the restriction spaces X° (fi) for various class of weight function. For example, Theorem 90 shows that 
the restrictions of the B-spline data functions are a member of the class of Sobolev-like spaces: 

(n) = {u€L^ {n) : D^u G (fi) forai<m,Q<i<m], m = 1, 2, 3, . . . 

It is easy to construct functions that arc in these spaces. 

As expected interpolant instability is evident and because our error estimates assume an infinite 
precision we filter the error to remove spikes which are a manifestation of the instability. 



2.2 The space Wg,x 

In this section wc will define the space Wq.x which contains the solution to the minimal norm interpo- 
lation problem 2.4. Here X is the independent data and G is the basis function. The next two results 
will be required to show that Wg,x is a well defined finite dimensional vector space. The next theorem 
requires the following lemma which we state without proof. 

Lemma 44 Suppose {a;^*^^}^^ is a set of distinct points in and v = {i'k)k=i ^ ■ Define the 
function a„ by 

fe=i 

Then: 

1. if (^) = for all ^ then Vk = for all k. 

2. The null space of a„ is a closed set of measure zero. 

Theorem 45 Let X = jx'^'^^j^j^ be a set of distinct points in . Then the set of translated basis 
functions {G (• — x'^^^)^^^^ is linearly independent with respect to the complex scalars. 
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Proof. Suppose for complex a^, ^ a^G {x — x^''^) = 0. Taking the Fourier transform we obtain 

k=l 

1 ^ lb 

. 

Since w (^) > a.e. we have Yl QifeC*^ ^ = and part 1 of Lemma 44 implies a/j = for all fc. ■ 

fe=i 

We now introduce the space Wg,x- 

Definition 46 The finite dimensional vector space Wg,x 

Suppose the weight function w has property W2 for parameter k, let G he the (continuous) basis 

function generated by w and let X ~ jx*^*^^ } ^^ ^^ be a set of distinct points in W^. We have shown in 
Theorem 45 that the functions G (• — x*^*^-*) are linearly independent so the span of these functions makes 
sense. We define the N— dimensional vector space Wg,x by 

^G,x = |$^afcG(--xW) :afcec|. (2.1) 

When convenient below, functions in Wg,x will be written in the form 

N 

foe {x) = E oikG {x - xC^)) where a = (ofe) e C^. 
fe=i 



2.3 The matrices Gx,x and Rx,x 

We will now define the basis function matrix Gx,x and the reproducing kernel matrix Rx,x- In this 
document we deal with basis functions of order zero so we will deduce the simple relationship Rx,x = 
(27r)^'^^^ Gx,x- These matrices will be used to construct the matrix equations for the interpolants studied 
in this document and the smoothers studied in Chapters 4 and 5. 

Definition 47 The basis function matrix Gx.x 

Let X = be a set of distinct points in W'- and suppose G is the basis function generated by 

w. Then the basis function matrix Gx,x is defined by 

Gx,x=(g (x«-x«)). 

Remark 48 Since G = {^)^ we have G (— x) = G (x) and so the matrix Gx,x is Hermitian. 

Definition 49 The reproducing kernel matrix Rx,x 

Suppose Rx is the Riesz representer of the evaluation functional f ^ f (x) introduced in Theorem 34- 

Suppose that X = {a;^'^^}^_^ is a set of distinct points in W^. Then the reproducing kernel matrix is 

Rx,x = (i?^o)(x(*))) . 

/ call this the reproducing kernel matrix because the reproducing kernel K {x, y) satisfies K (x, y) = 

Re- 
using a separate symbol for the matrix Rx,x fits in with the case of positive order, where Rx,x is not 
simply a scalar multiple of Gx,x but is related by a more complex formula. 

The reproducing kernel matrix Rx,x has the following properties: 

Theorem 50 The reproducing kernel matrix Rx,x has the following properties: 



1. Rx,x = {2tt)-''/^Gx,x- 
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^- Rx,x is a Gram matrix and hence positive definite, Hermitian and regular. 
3. The functions {-Rx('=)}feLi independent. 
Proof. Part 1 True since in Theorem 34 it was shown that Rx{z) = {2Tr)~'^^^ G{z — x). 

Part 2 Rx.x = {Rxir){x^^^)) ■ But from the definition of R^, R^(j){x^^^) = {RxU) ^ Rx<-''>) w o ^° Rx,x is 
a Gram matrix and hence is positive definite over C, Hermitian and regular. 

Part 3 In Theorem 45 it was shown that the functions G{z — x'^'^^) are independent and since Rx{z) = 
(27r)~''^^ G{z — x) the functions R^m are independent. ■ 



2.4 The vector- valued evaluation operator Sx 

The vector-vahied evahiation operator £x and its Hilbert space adjoint are the fundamental operators 
used to solve the variational interpolation and smoothing problems. 

Definition 51 The vector-valued evaluation operator £x 

Let X = be a set of N distinct points in M.'^. Let u be a complex-valued continuous function. 

Then the evaluation operator £x is defined by 



~Exu={u{xi^))l^. 



Sometimes we will use the notation ux for £xu and when dealing with matrices £xu will he regarded 
as a column vector. 



Theorem 52 Let X = be a set of distinct points in and suppose that the weight function 

w has property W2. Then we know that X^ is a reproducing kernel Hilbert space of continuous functions 
and the evaluation functional f f(x) has a Riesz representer, say Rx. The evaluation operator £x 
will be now shown to have the following properties: 



1. £x ■ i^X^, ll'll^ qJ — > (C^, |-|) is continuous, onto and null^x = Wq x- 

2. The adjoint operator E^ '■ defined by {£xf,g^^^ = (^f,£x9^ , satisfies 

N 



i=l 



and is a homeomorphism from [C^ , |-|) to (Wg,x, |M|^ q) • 



3. Suppose Rx,x = (-RxW) (^^'^)) ^^^^ reproducing kernel matrix and ||-|| is the matrix norm corre- 
sponding to the Euclidean vector norm. Then 



X 



£x 



= \\Rx,x\\ < ^VRo (0). 



4. The operator £x£x is self-adjoint and we have £x^x = \\Rx,x\(^ , as well as the formulas 



£*x£xf =T.fU'^)Rxi^), f^Xl (2.2) 



and 



[£*x£xf) {x) = (£xf, ^xRx)^^ = (/, ^x^xRx)^ ^ , f&Xl (2.3) 

Also, £x^x ■ X'^^Wg,x is onto and im\\£x£x = Wq x- 
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5. The operator £x£x self-adjoint and if we regard the range of £x as column vectors 

£xi*xf3 = Rx,xf3, /?eC^, 
where Rx,x is the (regular) reproducing kernel matrix introduced in Definition 49. 



6. IfX = {xW}^^ and Y = {y^'^}"^^ define Rx,y = {RyU) (x^^^)) and Gx.y = (G(x« 

so that Rx,Y — (27r)~^ Gx,y- Now when Ey is assumed to be a column vector we have ExSyP = 
Rx,yP when l3 eC^' . 

Proof. Parts 1 and 2 That £x is continuous follows from the Cauchy-Schwartz inequality and 



, N' 



£xu 



i=l i=l \i=l / 



w.O • 



Clearly £xu = iS {u, R^{i})^ ^ = for each R^{i} so that null fx = x- 

Next we show that £x is onto. The Hilbert space adjoint of £x is denoted E^ and is defined by 

{£xu,p)^^ = {u,£*xP)^^^. 

The adjoint is calculated using the representer R^: 

i=l 1=1 \ i=l / w.O 



SO that 



£*xf3 = Y.f3iR.i^u /?eC^. 



i=l 



N 



Finally we show that E^ is 1-1. But E^P = implies ^ PiRxd) = and, since R^a) = G (• — a;^''), 
Theorem 45 implies the iJ^-ci) are linearly independent and so /? = 0. 



Part 3 That 



S*x0 



X 



W.O 



X 



, i=l 



is an elementary property of the adjoint. Now 

op 

N \ N 

E PiPj (-Rx(i) > RxU) )w,0 

,j=l 

JV 

= E (^^'^) = P^Rx,xP, 



N N 

y^_^PiRx(i),y^j3jR,^ 



w,0 



SO that, 



= max " 



= -^^^ latter expression is the largest (positive) 



eigenvalue of the Hermitian matrix Rx,x i-e. the value of ||-Rx,x||- Hence 



X 



\R 



x,x\ 



£xf\' = f2\f {^^'^) f = E |(/, i?.<^)).,of < E I 



IL,0 ll-^xC') IL.O 



fe=l 



fe=l 



fe=l 

JV 



Ell/ll-,o^o(0) = A^iio(0) 



IL,o • 



fe=i 
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Part 4 The fact that Sx^x is self-adjoint and the formulas 2.2 and 2.3 arc simple consequences of the 

2 



definition of and Rx. That 



^*x^x 



-x 



is an elementary Hilbert space result and part 3 now 



implies 



£*x£x 



\Rx. 



x\ 



From Definition 46 we know that the functions {R^ii) form a basis for Wg,x, where the a;'*' are the 
unique points of X. Hence range = Wg,x and null 5^ = {0}. Now recall the closed-range theorem, 
see for example Yosida [14], p205, which states that for a continuous linear operator V, the range of V 
is closed iff the range of V* is closed. Since range £^ = Wg,x and Wg,x is finite dimensional, range 
is closed and so rangefx is closed. Consequently, using the result that rangeV = (nullV*)"'" for any 
continuous linear operator V, it follows that 



range fx = range fx = (^nuUf^^ = {0}^ = 



Part 5 

N N 

SxSxP = Sx^^PiRx(i) = ^^£xRx(')Pi = [SxRxW,- ■ .,£xRxW^ P = Rx,x0, 
i=l i=l 

since Rx,x = (RxU) (x^'^)). 

Part 6 The proof is very similar to that of part 5. ■ 



2.5 Minimal norm interpolation 

2.5.1 The minimal norm interpolant and its solution 

In this subsection we will formulate and solve the minimal norm interpolation problem for scattered 
data. The solution to this problem will be shown to be unique and its form derived. 

Before defining the interpolation problem we will introduce some notation for the data. We assume 
the data is scattered i.e. irregularly spaced, as distinct from on a regular rectangular grid. The data is a 
set of distinct points {{^x^^\yi ) }^^^ with e R'' and yi G C. We assume the x*^*^ must be distinct and 

set X = {a;^'^}^^ and y — {yi}fLi- In this document X will be called the independent data and y the 
dependent data. 

The minimal norm interpolation problem 

Suppose {{x'"^\yi)}'^^i is scattered data, where the x^'^ are distinct. 
We say uj e is a solution to the minimal norm interpolation 
problem if it interpolates the data and satisfies q < q 

for any other interpolant u € X^ . 



Using Hilbert space techniques we will now do the following: 

1. Show there exists a unique minimal norm interpolant. 

2. Show the interpolant is a basis function interpolant i.e. show that it lies in Wg,x- 

3. Construct a matrix equation for the coefficients of the data-translated basis functions G (• — x^^^). 

The next theorem proves the minimal norm interpolation problem has a unique solution. 

Theorem 53 For given data there exists a unique minimal norm interpolant Ui. If v is any other 
interpolant satisfying £xv = y then 

IKC + lb-«/C = II^C, (2-5) 

or equivalently 

{v-ui,ui)^^o = 0. (2.6) 



(2.4) 
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Proof. Since the evaluation operator £x '■ is continuous and onto, and the singleton set {y} 

is closed, it follows that the set 

is a non-empty, proper, closed subspace of the Hilbert space X^. Hence this subspace contains a unique 
element of smallest norm, say u/. If £xv = y then 

|u : M e and £xu = = |u — s : s G null fx | • 



Now 



linjll?; - s\\^Q : s € nuUfjfj = dist ^i;, null fx) , 



is the distance between v and the closed subspace null fx- Therefore there exists a unique sj G null fx 
such that 

Ui = V — Si, (2.7) 
11^ - «/IL,o = minjll^^- s||^_o • « S nullfxj , 

and 

ll«/|l^,o + 11^ - ^i\\l,o = MI,o ■ (2-8) 

Substituting for sj in 2.8 using 2.7 yields 2.5. Equation 2.6 follows since it is a necessary and sufficient 
condition for 2.5 to be true. ■ 

Next wc prove that the minimal norm intcrpolant lies in the space Wg,x of Definition 46. 

Theorem 54 For each data vector y G the minimal norm intcrpolant uj is given by 

ui = e*xRx,xy = (27r)'/' £*x (Gx.x)-' y, (2.9) 
where ExR^x '■ ~^ Wg,x is an isomorphism. We also have 

ui {z) = (ixR^Y Rx^xV- (2-10) 

Proof. From part 5 of Theorem 52, fxfx — Rx.x- Now by Theorem 50 Rx,x is regular but in general 
Rx.x 7^ / so in general fxfx2/ V ^xV is not an intcrpolant. However, fx (^fx-^x^x) = s*-* 

S^Rx^xV is intcrpolant in Wg,x- For convenience set u = S^Rx^xV- want to show that u = ui. 
But ' ^ J 

{u, ui -u)^Q = (£xRx\y^ ui-u) = (Rx\y, fx (w/ - u)) = 0, 

V / wfi \ ' wfi 

since uj and u are both interpolants to y. Thus 

ll"/lll,o = - w + ufw.Q = II"/ - -"11^,0 + ll^^lllo ' 

since {u,ui — u)^q = 0. If m 7^ m/ then ||M||j„g < ||'"/|ltuO' contradicting the fact that u/ is the 
minimal intcrpolant. Since Rx,x is regular, R^x isomorphism from C''^ to C''^ and because f^ is 
an isomorphism from to Wg,x we have that S^Rx^x • ~* Wg,x is an isomorphism. 

Finally, using the definition of the adjoint £\ 

ui (z) = {ui, Rz)^fi = (SxRx^xV' ^^)^ = {Rx^xV' ^xRz) = {Sx'Rl) Rx^xV- 

■ 

The last theorem allows us to define the mapping between a data function and its corresponding 
intcrpolant. 



46 2. The minimal norm interpolant 

Definition 55 Data functions and the interpolant mapping Ix ■ Wg,x 

Given an independent data set X, we shall assume that each member of X^ can act as a legitimate 

data function f and generate the dependent data vector £x f ■ 

Equation 2.9 of Theorem 54 enables us to define the linear mapping Ix '■ X'^ Wg,x from the data 
functions to the corresponding unique minimal norm interpolant Ix f = uj given by 

Ixf = S*xR-x]x^xf, f&Xl. (2.11) 

Since Exixf = £-xf , Theorem 52 can he easily used to show thatlx is a self-adjoint projection onto 
Wg,x with null space Wq-^, and that Ixf = f iff f & Wg,x- Since f interpolates the data it follows 
from Theorem 53 that 

||/-X^/IL,o<ll/IUo> ll^x/IL,o<ll/L,o> f^Xl, (2.12) 
i.e. Ix and I — Ix are contractions. 

We now know that the interpolant lies in Wg,x and from Theorem 45 we know that {G (• — x^'^^)} is 
a basis for Wg,x- The next step is to derive a matrix equation for the coefficients of the G (• — x^^^): 

Theorem 56 The space Wg,x contains only one interpolant to any given independent data vector y € 
C^. This interpolant is the minimal norm interpolant and is defined uniquely by 

N 
k=l 

where the coefficient vector v = {vk) satisfies the regular basis matrix equation 

Gx,xv = y. (2.14) 

Proof. From Theorem 54 we know that for a given independent data vector y, Wg,x contains the 
minimal norm interpolant. Suppose Wg,x contains another interpolant vj. Then £x (w/ — vj) = and 
so ui — vi G null fx- But from part 1 of Theorem 52 null fx = Wqx- Hence uj — vj G Wq x ^^'^ 
ui = VI since ^ ^ ^G.x = {0}. 

Equation 2.13 and the interpolation requirement ui (x^'^^) = yk implies Gx,xv = y and by Theorem 
50 Gx,x is regular. ■ 

2.5.2 Error estimates derived without assuming unisolvent data sets 

In this subsection we will prove several pointwise error estimates, also known as convergence estimates, 
for the minimal norm interpolant Ixf to its data function / S X^ as the independent data X 'fills' a 

bounded, closed data region K. 

Unlike the results given by Light and Wayne for positive order in [6] and [7], the results of this 
subsection do not use Lagrange interpolation theory and so the proofs arc much simpler. The order of 
convergence appears as the power of the radius of the largest ball that can be fitted between the data 
points X. 

In this subsection we will consider two types of estimate which do not use unisolvent data sets. For 
want of better names we will just call them Type 1 and Type 2. In the case of Type 1 the weight function 
has property W2 for some k > and the basis function G satisfies an extra smoothness condition near the 
origin. For Type 2 estimates the weight function has property W2 for some k> 1 and we use pointwise 
estimates based on the Ricsz representer. 

In the next subsection we will derive an error result which improves on the result of Type 2 by using 
the Lagrange interpolation theory. These results require that the independent data points constitute a 
unisolvent set and assume that the data region satisfy a cone condition. An order of convergence of [kJ 
is then obtained. 

When deriving error estimates we hope that as the number of evaluation points X increases i.e. as 
the 'minimum density' of the points increases, the interpolant will converge uniformly pointwise to the 



(2.13) 
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data function / on the data region K. The minimum density of the independent data points is measured 
using the expression 

/ixif = niaxdist(a;,X) = max min x — x'^'^^ . (2.15) 

Here hx,K is the maximum distance between a point in K and a point in X, or equivalently hx,K 
is the radius of the largest open ball centered in K which does not contain any of the data points. Its 
use clearly makes sense intuitively but from a numerical point of view, in dimensions greater than 1, the 
calculation of hx,K presents formidable difficulties. 

Theorem 34 tells us that the representer of the evaluation functional f ^ f {x) is 
= (27r)"'^/^ G (• - a;) e X%. Thus for any data function / e 

f{x)-{Ixf) {x) = if-Xxf,R.)^^o^ 
where l^xf is the minimal interpolant of the data function on the independent data set X. Hence 

1/ (x) - [ixf) {x)\ = |(/ - ixf, i?.L,o| < 11/ - 2:x/IL,o l|i?x|L,o 

= (27r)-'^/^yG(0)ll/-2:x/IL,o, (2.16) 

where G (0) is real and positive. The error information is clearly now restricted to the expression 
11/ - Jx/IL if tiie inequality 

||/-Ix/IL,o<ll/IL,o> (2-17) 

implied by equation 2.5 is used the error information is completely lost. We can retain some of this 
information as follows. Choose any independent data point x^^'> . Then the interpolation property implies 

/ {x) - {Ixf) {x) = (/ - Ixf) {x) -if- Ixf) {x^'^) . (2.18) 

In the results proved below we will use the estimates for \f {x) ~ f {y)\ and \G (x) — G {y)\ derived in 
Section 1.6. Before deriving the first error estimate I will show that there exists a suitable sequence of 
independent data sets X^^^ with hxw^K tending to zero: 

Theorem 57 Suppose K is a bounded, closed, infinite set containing all the independent data sets. Then 
there exists a sequence of independent data sets X^''^ c K such that X^''^ c and hx(k) ^ — > as 

k oo. 

Proof. For each k = 1,2,3,... there exists a finite covering of K by the balls 

|-B ^a^j'''; I . Construct X^^^ by choosing points from K so that one point lies in each ball 

B (^al^^; 1^ . Construct X^*^"*"^^ by first choosing the points X*^*^) and then at least one extra point so that 
js^(fe+i) contains points from each ball B ^a^^^^; j^^- 

Then x € K D X^") impHes x € B (^a^f^ for some j and hence dist {x,X^'''>) < p Thus hxw^K = 
maxdist {x,X'^^'>) < t and lim hx(k) ^ — 0. U 

We start with some simple bounds for the pointwise error of the interpolant: 

Theorem 58 Suppose the weight function w has basis function G. Then the interpolant Ixf of any 
data function f satisfies 



\f{x)- {Ixf) {x)\ < ,J{f - Ixf, /)^,oy^40), x&W', (2.19) 



where ^{f - Ixf, /)^,o < ||/L,o ^nd i?o (0) = (27r)-^ G (0). 
Also 

\Ry {x) - {IxRy) {x)\ < Ro (0) , x,ye R'^. (2.20) 
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Proof. Since Ix is a self-adjoint projection w.r.t. (•, •)^ q 

\f{x)-{Ixf){x)\= {f-Ixf,R.)^,o <\\f-IxfL,oWL,o 



= ^{f-Ixf,f)^,o\/R^- 

But Ry (x) = (27r)"^ G(a: - y) and inequalities 2.12 imply \\f -IxflL^o < ll/IL,o so that 
^(/-Jx/,/)^ o < 11/11^ and i?, (x) = i?o (0) = (27:)"^ G (0). Now letting f = Ry in 2.19 we get 



\Ry {x)-iJx Ry){x)\ < J{Ry-JxRy,Ry)^WRo{0) 



= ^Ry{y)-(.lxRy){y)VRo (0), 
which implies that when x = y, Ry {y) — {IxRy) {y) < Ro (0) and so 

\Ry {x)-{IxRy) (a;)| <i?o(0), 

as required. ■ 
Typo 1 pointwisc estimates 

Type 1 estimates place no a priori restriction on k. In the next theorem a smoothness condition is 
applied to the basis function near the origin and this will allow a uniform order of convergence estimate 
to be obtained for the interpolant in a closed, bounded, infinite data region. The next result is based on 
part 2 of Corollary 36. 

Theorem 59 Suppose the weight function w has property W2 and that G is the basis function generated 
by w. Assume that for some s > and constants Cq, he > the basis function satisfies 



G{0)-ReG{x) < Cg\x 



,2s 



\x\ < ha- 



(2.21) 



Let Tx be the minimal norm interpolant mapping with the independent data set X contained in the 

' _d 

closed, bounded, infinite set K, and let kc = (27r) * \/2Cg- Then for any data function f € it 

and 



follows that J (/ - Ixf, /)^_o ^ 



\\w,0 



\f (x) - Ixf {x)\ < ka^if - Ixf, /)^,o ihx,Kr , xeK, 



(2.22) 



when hx,K = maxdist {x, X) < he i-e- the order of convergence is at least s. 



Proof. Prom Theorem 58, ^{f -Ixf, /)^^o ^ ll/IL,o- fix a; e -fC and let X = C K he 

an independent data set. Using the fact that Ixf interpolates / on X we can apply part 2 of Corollary 
36 to obtain 

1/ (x) - Ixf {x)\ = \{f- Ixf) (x) - (/ - Ixf) 

< {27r)-i ^/2 ||/ - Jx/IL,o G (0) - Re G {x - xO)) 



= {2n)-- V2J{f - Ixf, /)„ VG (0) - ReG (x - xU)), 



for all j, where the last step used the fact that Ix is a self-adjoint projection. Then, noting that 
dist {x,X) < ho, we can apply the upper bound 2.21 and obtain 



1/ ix) - Ixf {x)\ < (277)-' V2^if - Ixf, /)^,o-\/ \x - xU) 



1 2s 



-Ixf, f)yjfi 



X — X 



0) 
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for all j and so 



\fix)-Ixf {x)\ < fcG^(/-2x/,/L,o (dist {x, X)y 

< kcJif -Ixf,f)^o ( max dist {x,X) 



where the last step used inequalities 2.12. Since ^(/ —Ixf, f )^ q < ll/IL o order of convergence 
is at least s. m 

Wc now prove a double error rate for the interpolant of data functions which are Riesz representers 
Rx'. This predicts a convergence order which is at least double that for an arbitrary data function. It 
also predicts a global bound for the error. 

CoroUciry 60 Under the notation and assumptions of the previous Theorem 59 the Riesz representer 
data functions R^' satisfy 

\R,, {x) - {IxRx') {x)\ < [kaf {hx,Kf' , x € K, x' € M.'', 

when hx,K < ho i-e. the order of convergence is at least 2s in hx,K- 
Proof. The key to this result is the term ■^{f, f — '^xf)wfi ™ the estimate 2.22 proved in the last 
theorem. Substituting / = R^' in 2.22 gives 



R^. {x) - {XxRx') {.x)\ < kaJiR^' -IxRx',Rx')^ o ihx,Ky 



= kc^R^' {x') - {IxRx') {x') {Hx^kY , 
so that 

which implies 

\R.' {x) - {IxRx') {x)\ < {kaf {hx,Kf' , 

as required. ■ 
Examples: radial basis functions 

For the shifted thin-plate splines, Gaussian, Sobolev and extended B-splines we will derive the interpolant 
error parameters s, ha and Cg defined above in Theorem 59. To do this I will use the results of Theorem 
40 when k > 1: namely if w satisfies property W2 for k = 1 then 

\G{0)-ReG{x)\<CG\xf , xgR'^, (2.23) 

where 

G {x) = f (r2) imphes Co = ||2r/" + f\\^ , (2.24) 

G{x)=g{r) implies Cg = I yiU- (2.25) 

If K < 1 I will use the mean value theorem. 
Shifted thin-plate splines 
Prom 1.4.2 the basis functions are given by 

G (x) = {-1)^"^ (l + \x\^y , -d/2<v<0. 

The weight function has property W2 for all k > so we can use the estimates 2.24 and 2.25. But G 
depends on \xf so 2.24 is easier algebraically: / (r) = (—1)^^^ (l + r)^, /' (r) = (—1)'^"^ w(l-l-r)^^^, 
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/"(r) = {-l)^''\{v-l){l+ry-^ and /"' (r) = {v - 1) {v - 2) {1 + rY'^ For the func- 



tion \2rf" + f'\ to take a maximum 3/" (r) = —2rf"' (r) must be satisfied i.e. 3v {v — 1) (1 + r)" ^ = 



-2v {v -l){v- 2) (1 + r)" ^ which reduces to 3 (1 + r) = -2 (t; - 2) = 2 (2 - so that 

Cg = |(2r/" + /') (wx)! , «;/iere / (r) = (1 + r)" and w = ^^^i^. (2.26) 



Also he = oc , .s = 1 and ka = {2it) * \/2Cg- The order of convergence for an arbitrary data function 
is at least 1 and that for the Riesz representer is at least 2. 

Gaussian 

The weight function has property W2 for all k > 0. It is easier algebraically to set G (x) = / (r^) where 
/ (r) = e^*" and use 2.24. For the function 2rf" + f' to take a maximum 3/" (r) = —2rf" (r) must be sat- 
isfied i.e. 3e~'' = 2re~'" so that rmax = 3/2 and hence Cq = {2rf" + /') (rmax) = (2re~'" — e"'') (rmax) = 
2e-3/2: 

Cg = 2e-3/2. 



Also /iG = oo, s = 1 and /cg = (27r)~* \/2Cg- 

The order of convergence for an arbitrary data function is at least 1 and that for the Riesz representer 
is at least 2. 

Sobolev splines 

For this case the basis function is 1.15 i.e. 

G{x) = ^^^^K,_^/^{\x\), xeM^ (2.27) 

constrained by u > d/2. The weight function has property W2 for < k < v — d/2. There are two 
cases: v — d/2 is a positive integer and v — d/2 is a positive non-integer. Refer to Sections 3.1, 3.2 and 
3.3 of Magnus et al. [9] for the properties of Ky and Ky given below: the smoothness results have been 
obtained from the infinite series representations. 



Case 1: V — d/2 = 1, 2, 3,. . . Here we are dealing with the modified Bessel functions of integer order. 



Now rKo has properties 

rKo e C(°) ([0, oo)) n C°° (0, oo) ; lim rKo (r) = 0; Kq (r) > 0, r > 0; 

r— >0 

lim rKo (r) = 0. 

r— ^oo 

and numerical experiments lead to the hypothesis: rKo (r) has a single turning (maximum) point: 

< rKo (r) < r' Kg (r') < 0.4665 when < r < r' < argmin pKo (p) ~ 0.595. 

In general, for m = 1, 2, 3, . . ., K^ = r'^K^ {r) satisfies 

K_m = Km; ^™eC(2'"-i)([0,oo))nC°°(0,oo); ^„(0) = 2—1(771-1)!; 

km (r) > 0, r > 0, 

the derivative of Km satisfies 

DKm (r) = -rKm-i (r) ; DK^ (0) = 0; DKm (r) < 0, a; > 0; 
lim DKm (r) = 0, 

r— ^oo 

and if m > 2, the second derivative of Km satisfies 

D^km (r) = -km-i (r) + r^km-2 (r) ; D^Km (0) = -2™-2 (m - 2)! ; (2.28) 

lim D^km (r) = 0. 
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Now we want to estimate [G (0) — Re G {x)\. 

When V — d/2 = 1 we have k <1 and so we cannot use 2.23. However, by the mean value theorem 



Ki (r) - Ki (0) 



< max 

pe[0,r] 



DK, ip) r = max {pKo (p)) r < \\pKo {p)\\^r, 

/9e[0,r] 



so that 



G (0) - Re G {x) = ^;r4>) - ^1 (1^1)) 



< 



2''-ir(w; 



l|pi^o(p)IU|x| 



and consequently 



if V = 1 then s = -, Cq = ., ' \ , ha = oo. 

•'2 2 2''-ir(w) 



(2.29) 



(2.30) 



On the other hand if t; — rf/2 = 2, 3, 4, . . . then k> 1 and we can use the estimate 2.23. Indeed, since 



G{x)=g (r) = 2^-ir(t;) -^"-'^/^ ^""^ ^ 



2"r (v) 



v-d/2 



t;- - =2,3,4,..., 



and thus 



i/ u — — = 2, 3, 4, . . . t/ien s = 1, Gg is given by 2.32, /ig = oo. 



(2.31) 
(2.32) 

(2.33) 



Case 2: V — d/2 > and non-integer Here we are deal with the modified Bessel functions if^ of pos- 



itive non-integer order. We have 

K_^ = K^; i?^eG°°([0,oo)); K^{0) = 2M-'T {\f,\) : 



(r) > 0, r > 0; lim (r) = 0; 

r— ^oo 



Concerning the first derivatives: 



so that 



DK^ (r) = -rK^_i (r) , 



DK^ (0) = 0; DK^ (r) < 0, r > 0; lim DK^ (r) = 0. 

r — ^oo 

Regarding the second derivatives: 

D^k^ (r) = -K^_i (r) + r^K^_2 (r) , 

so that 



D^K^ (0) = (0) = -2i''-2ir (Im - 1| 



and 



lim (r) = 0. 

r— >oo 



(2.34) 



Now we want to estimate G (0) — Re G {x) . 

When < v — d/2 < 1 we have k < 1 and so we cannot use 2.23. However, using a second order Taylor 
series expansion (Appendix Subsection A. 7) 



< — max 
2 te[o,r] 



D^K^ {t) 



K^{r)-K^{0) = K^ir)-K^{0)-rDK^{0) 
so that 

G(0) - ReG(a;) = ^^jz^^ (0) - k,-,/2 (r)) < 



1 

< - 
- 2 



v-d/2 
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and 



Cg = 



0<v-d/2<l. 



(2.35) 



When V — d/2 > 1, i; — | ^ Z+ we have k> 1 and so we can use 2.23 and 2.25 to get 



Cg = 1\\9"\ 



1 



2^ (v) 

and together with 2.35 we can conclude that: 



v-d/2> 1, 



s = 1, Cg is given by 2.35, Hg = oo when v — ^ ^ Z+, v — ^> 0. 



(2.36) 



Examples: tensor product extended B-splines 
With reference to the error estimate of Theorem 59: 

Corollary 61 Suppose the weight function is a tensor product extended B- spline weight function with 

parameters n and, I . Then, if Gi is the univariate basis function, the order of convergence s of the minimal 
norm interpolant to its data function is at least 1/2 and Cg = VdGi (O)'^ ^ and he = oo. 

Proof. Theorem 43 showed that 

\G{x)-G{y)\< VdGi {Of-^ \\DGi \\^ \x - y\ , x,y G R", 

so Theorem 59 gives this result. ■ 

Examples: summary table 

Table 2.1 summarizes the convergence results 2.30, 2.33 and 2.36 for the Sobolev splines as predicted by 
Theorem 59, as well as those for the other Type 1 examples. 





Interpolant error estimates 




Type 1 non 


-unisolvent estimates: k > 0, /cg = {2Tr)~'^^'^ \/2Cg- 






Parameter 


Converg. 






W(nglit function 


constraints 


orck^r .s 


Cg 


ha 


Sobolev splines 


v-\ = \ 


1 

2 


2"-ir(t;) 


oo 


{v > d/2) 




1 


W ' 1 1 oo 


oo 




Shifted thin-plate 

{-d/2 < V < 0) 




1 


eq. (2.26) 


oo 


Gaussian 




1 




oo 


Extended B-spline 




1 
2 


Gi {Of-'WDG^W^VdW 


oo 



TABLE 2.1. 



Another approach - the interpolation error seminorm 

Here we will show how the definition of a seminorm using the interpolation operator can be used to 
obtain the above results concerning interpolation in a very concise manner. In Chapter 4 which studies 
a smoother (a stabilized interpolant) we will also make use of several seminorms to study the smoother 
error. 

Definition 62 Interpolation error seminorm and semi-inner product 

Suppose we selected the semi-inner product (/ — Ixf,g — ^xg)^ q for pointwise estimation of f — Ixf ■ 
But Xx is a self-adjoint projection so (/ — Ixf,g — Ixg)^ o ~ (/ ~ ^xf,g)yj o o-f^d this is more suited 
to pointwise estim,a,tion since 

(/ — Xxf, Rx)yj Q = / (x) — (Xxf) (x) . So for f,gG X'^ let us define the interpolation error seminorm 
and semi-inner product by 



I /I, = (/ - Ixf, /)^,o > (/> 9) I = (/ - Ixf, g) 
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The next theorem shows the close relationship between the pointwise error of the interpolant and the 
interpolation seminorm. 

Theorem 63 If f,g & then some properties of the interpolation seminorm are: 
1. im\\\-\j = WG,x. 

2- I/I? = ll/lllo - UxfWl,^ and {f,g)j = {f,g)^^, - {Ixf,g)^,o- 

3. {f,g)i = {f^g)^^^-(ixfY {Rx,x)-^£xg. 

4. {f,R.)j = f{x)-{Ixf)ix). 

5. \Rx - RxW \i = \Rx\i when x^'''> e X. 

Proof. Using the comments following the definition of the data function and the interpolant map - 
Definition 55 - as a guide the proofs arc straight forward. ■ 

The pointwise error of the interpolant has the following properties: 

Theorem 64 For all x,y € M"^; 

1. \Rx (y) - {IxRx) {yf < {R. {x) - {IxR.) (x)) {Ry (y) - {IxRy) (y)) ■ 

2. Rx{x)-{IxR.){x) = ||i?,-i?,(.)||^o-||Xx(i?.-i?,(.))||^,o> 2^^'=' ex 

Proof. Part 1. This is just the Cauchy-Schwartz inequality \{Rx-iRy) j\ < |-Rx|/ \Ry\i- 
Part 2. From part 2 of Theorem 63 

\Rx - Rxi^Ai = \\Rx- ^2:J|^ 0- 11^^ (-^^ - -'^xJII^ ^"^^ from part 4 of Theorem 63 \Ra:\i = Rx {x)- 
{IxRx){x). ■ 
From these two theorems and the calculation 

= Rx {x) - Rx (xC^)) - Rx (ajC^)) + 

= 2 (27r)-'*/' (g (0) - Re G - x^^)) ) , (2.37) 
the interpolation results of Theorem 59 and Corollary 60 can be obtained without much difficulty. 
Type 2 pointwise estimates 

Type 2 pointwise estimates only assume k > 1. The Type 1 estimates of Theorem 59 considered the 
case where the weight function had property W2 for some k > and an extra condition was applied to 
the basis function. In the next theorem we only assume that k > 1 and derive an order of convergence 
estimate of 1, as well as a doubled convergence estimate of 2 for the Riesz data functions Ry. 

Substituting a = (3 = and y = in part 3 of Corollary 37 gives the Riesz representer-type estimate 

since G (0) is real and |G (a;)| < G (0). Hence G always satisfies 2.21 with s = 1/2 and the theory of 
Type 1 convergence estimates can be always be applied to obtain convergence rates of at least 1/2. Also, 
K> 1 implies G G G'-L^'^J-' C G^^^ and the mean value theorem or the Taylor series expansion can always 
be applied with s = 1/2 or s = 1 to obtain Type 1 convergence estimates. 

However, instead we will start by using part 1 of Corollary 36 - a Riesz representer-type estimate 
for data functions - to obtain some error estimates. Note that the constant kc in the next Theorem is 
defined to match the constant kc for the Type 1 convergence estimate 2.22. 
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Theorem 65 Suppose the weight function w has property W2 for a parameter k> 1. Suppose also that 
Ixf is the minimal interpolant on X of the data function f G and that K is a bounded, closed, 
infinite subset ofM.'^, X c K and hx k = maxdist {x, X). 

Then ^ 

' (2.38) 



1/ (x) - (Ixf) {x)\ < kaJif - Ixf, f)^ahx,K, x€K, 



where 



n_ ^_(2^)-2 (v^G) (0), 



{f-lxfj)^,0 < ||/||.u,.0' 



(2.39) 



and the order of convergence is at least 1. Further, for the 'Riesz' data functions Ry, y £ K, we have 



\Ry (x) - (IxRy) {x)\ < {kaf {hx,Kf , x,y & K, 
i.e. a 'doubled' estimate of at least 2 for the rate of convergence. 
Proof. Let X = {x^i">} Then since Tx is an interpolant on X 

f {x) - {Ixf) {x) = (/ - Ixf) {x) -if- Ixf) (xW) , 
for aU j and applying the inequality of part 1 of Corollary 36 to / — Ixf & X'^ we get 
1/ {x) - {Ixf) {x)\ = \{f- Ixf) {x) -if- Ixf) {x^''>) 

<kG\\f-Ixf\L,0 



(2.40) 



X — x^^^ 



kGJ{f-Ixf,f)^,o 



, j = l,...,N, 



(2.41) 



Now suppose that X C K. Then the inequalities 2.41 imply that 



\fix)-{Ixf) {x)\<kG^{f-Ixf,f)^,o<i^st{x,X), 
and so if x is restricted to K 

\f{x)- {Ixf) {x)\ < kc^Jif- Ixf, /)^,o mi^dist {x, X) = kG^J{f-Ixf,f)^,ohx,K. 
Thus when f = Ry 



\Ry (x) - (IxRy) {x)\ < kc^ (Ry - IxRy, Ry)^^ohx,K 

= kG^ Ry {y) - i^xRy) {y)hx,K, 



(2.42) 



so that when x = y 



Ry (y) - (IxRy) iv) < kGJRy (y) - (IxRy) {y)hx,K, 



and Ry [y) — (IxRy) (y) < {kGhx,K) ■ IncquaHty 2.42 now implies inequality 2.40 as required. 
Finally, that / ^ = - {2-k)'^^'^ (V^G) (0), follows directly from 1.45. ■ 

Examples: radial basis functions 

It was shown in Subsection 1.2.3 that the shifted thin-plate splines and the Gaussian have weight 
functions satisfying property W2 for all k > 1. Thus we can apply Theorem 65 to obtain the same orders 
of convergence that were obtained for the Type 1 results in Subsubsection 2.5.2 i.e. a convergence of 
order 1. 

It was shown in Subsection 1.2.3 that < k < i; — d/2 for the Sobolev splines. Thus we can choose 
K> 1 iff t>-(i/2 > 1. 
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From 2.39, fc^ = - {2?:^^ (V^G) (0). Now if G (x) = / (r^) then by part 2 of Theorem 39, (V^G) (0) = 
2/' (0) d and so 

fcc = (27r)-''/^ V-2/' (0)\/rf, (2.43) 
and if G{x)=g (r) then by part 1 of Theorem 39, (V^G) (0) = g" (0) d and so 

fee = (27r)"'*/^ V-fl" (0)^/^- (2-44) 

Shifted thin-plate splines / (r) = (1 + r)", // (r) = (1 + r)""^ and so fee = (27r)"'^/'' 
Gaussian / (r) = e"'', // (r) = —e~^ and hence fee = {2-k)~'^^^ \/2\fd. 
Sobolev splines Here we use results of Subsubsubsection 2.5.2. From 2.27, 

9 (^) = 2-^v{v) ^v-dli (^) and so g" (0) = j,r^^^D^K^_a/2 (0). 
If u — d/2 is an integer then t; — d/2 = 2, 3, 4, . . . and by 2.28 

„ ^ D'K,.d/2 (0) ^ 2^-'^/^-^(^-rf/2-2)! ^ - d/2 - 2)! 
^ ^ ^ 2''-ir(t;) 2''-ir(v) 2d/2+ir(t;) ' 

and by 2.44 

ka = (2.)-'/' V^^)^d = [2.)-"' (2.45) 

liv- d/2 is not an integer then t; - d/2 > 1 and v - d/2 ^ 2, 3, 4, . . .. By 2.34 

„ ^ D^K,-d/2 (0) ^ 2\-''/^-^\T i\v - d/2 - 1\) ^ 2l— ^/2-2lr(.;-d/2-l) 
9 W 2^-^r{v) 2''-ir(v) 2''-ir(?;) 

r(r-r//2-l) 



2„-l-|^-d/2-2|p(^)' 



and 



-\v-d/2-2\Y (^y^ 



' {2^)-"/' ^ ^.t-Jl%l) ^d, l<v-d/2<2, 

v-d/2> 2, 
d/2 7^ 2, 3, 4 



Finally, 2.45 and 2.46 can be combined to give 
kc = 



(27r)-'^/%/ ,L'!;//i'r?,^ Vrf, l<«-d/2<2, 



(2-r'''^^53^A .-d/2>2. 



Examples: tensor product basis functions 

Extended B-splines {I < n < I) Since the weight function has property W2 for «; iff k + 1/2 < n we 
can choose k = 1 iff n > 2. Then by Theorem 65, kc = {2'k)~'^^^ y/— (V^G) (0) and since 

d d 
(V^G) (0) = J2 (DIG) (0) = ^2^1 (^i) (^2) . . . Gi (a;,)) (0) 

fc=i fc=i 

d 

= ^Gi (0)''-'L»2Gi (0) 
fe=i 

= Gi (0)"^-^ £>2Gi (0) d, 
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it follows that 

These convergence results are summarized in the Table 2.2. 
Examples: summary table 



Interpolant convergence order estimates 
Type 2 non-unisolvent estimates: k> 1 


Weight function 


Parameter 
constraints 


Converg. 
order 




flG 


Sobolev splines 
{v > d/2) 


v-^>2 


1 


lT(v-d/2-l) n 

Y 2''/2+ir(«) 


00 


l<v-^<2 


1 


/ r(«-d/2-i) n 

Y 22''-<i/2-3r(^) V " 


00 


Shifted thin-plate 

{-d/2 <v <0) 




1 


^/-2v\/d 


oo 


Gaussian 




1 


^/2^/d 


oo 


Extended B-spline 
(1 < n < 


n>2 


1 




oo 



TABLE 2.2. 



2.5.3 Pointwise error estimates using unisolvent data sets when k> 1 

Unisolvent sets of independent data points are fundamental to the theory of positive order basis function 
interpolation. In that case the order of the unisolvent set is the order of the basis function. We will now 
show that this technique can be used when the order is zero. Here, however, the order of unisolvency is 
related to the value of the weight function parameter k. 

In Theorem 65 wc assiimed the weight function had property W2 for some k > 1 and so obtained an 
order 1 estimate for the convergence of the interpolant. In this subsection we will show that by assuming 
the data is a unisolvent set of points of order [k\ an order [kJ convergence estimate can be obtained for 
and arbitrary data function. 

Definition 66 Unisolvent sets and minimal unisolvent sets 

Recall that Pm is the set of polynomials of order m i.e. of degree m — 1 when m> 1. 
Then a finite set of distinct points X = {xi} is said to he a unisolvent set with respect to Pm if: 
p G Pm and p{xi) = for all Xi £ X implies p = 0. 

Sometimes we say X is unisolvent of order m or that X is m-unisolvent. 

It is known that any unisolvent set has at least M = dimP^ points, and that any unisolvent set of 
more than M points has a unisolvent subset with M points. Consequently, a unisolvent set with M points 
is called a minimal unisolvent set. 

Theorem 67 Suppose M = dimP^ and A = {ai}fi-^ C M''. 

1. If {Pj}^i is any basis of Pm then A is minimally m-unisolvent iff det {pj (aj)) ^ 0. 

2. In one dimension any set of m distinct points is m-unisolvent. 

Proof. Part 1 follows directly from the definition of unisolvency. Part 2 Choose the monomial basis 
{x'^l^g for Pm. Then the determinant of part 1 is Vandermonde's alternant which is singular iff two 
points coincide. ■ 

Definition 68 Cardinal basis for polynomials Pm- 

A basis {li}f^i for Pm is a cardinal basis for the minimal unisolvent set A = {oi}^^ ifk {aj) = Sij 
and the Ij are polynomials with real valued coefficients. 

It is known that a set is a minimal unisolvent iff there exists a (unique) cardinal basis for the set. 
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The next step is to introduce the Lagrange interpolation operator V and the operator Q = I — V: 

Definition 69 The operators V : C(°) P„^ and Q : C(o) ^ C(°) . 

These operators are only defined for integers m > 1. Suppose the set A = {oij^j is a minimal 
unisolvent set with respect to the polynomials Pm and by Definition 68 there is a unique cardinal basis 
{h}iLi for A. Then for any continuous function f the operators V and Q are defined by 

M 



^/ = E/(°')'- Qf = f-'Pf- 



Theorem 70 The operators V and Q have the following elementary properties: 

1. p € Pm implies Vp = p and hence Qp = 0. 

2. Vf interpolates the data { (aj, / (ai)) Indeed, V is termed the Lagrange polynomial in- 
terpolation function. 

3. = V, VQ = QV = and Q"^ = Q so the operators are projections. 
4- null Q = range P. 

M 

Proof. Part 1 is true since each member of the cardinal basis satisfies Vlj = J2 i^i) ^ — h- Part 2 
is true because li (aj) = 6ij. Regarding part 3, P is a projection since by part 2 

(M \ M 

i=l / |a|<fe i=l 

and so 

VQ^P{i-P) = v-r'^ = o = {i-r)P = Qp, 

which implies = Q{I — P) ~ Q and we have proved part 3. Finally P + Q = I so part 4 holds. ■ 

To study the pointwise order of convergence of the minimal interpolant to its data function we will 
need the following two lemmas. The first lemma provides a pointwise upper bound for the operator Q 
using a multipoint Taylor series expansion which will express Qf {x) = u{x) — Pu [x) in terms of the 
derivatives of order m evaluated on the intervals [a;,ai]. The basic tool is the Taylor series expansion 
with integral remainder. 

Lemma 71 Suppose f G C^"*) (M'') and A = {ai}fi-^ is a minimal unisolvent set of order m > 1. Then 
we have the upper bound 



,i=l 



KzeSA,x 



M 

where Sa,x is the union of closed intervals Sa,x = U [^) j] • 

i=l 

Proof. Prom Definition 69 of P and Q 

M M 

Qf {x) = f{x)- Pf {x) = f{x)-J2f (ai) h {x) = / (a;) - ^ / + (a, - ar)) (x) . 

i=l i=l 

Using the Taylor series expansion formula with integral remainder given in Subsection A. 7 of the 
Appendix, we have for each i 

f{x + {ai-x)) = ^^^^^{ai-xf + {TZmf){x,ai-x), 

\l3\<m ^' 
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where the remainder term satisfies 



\{'J^mf){x,ai- x)\ < — - max \D'^f{t)\ \ai - x\ 

ml |/3|=m ' ^ 'I ' 

i£[x,ai] 



Thus 

M M 



^f{x+{ai-x))k{x)=^[ ^ ^f:^{ai-xf + {n^f){x,ai-x)\k{x) 



i=l i=l \|/3|<m ^ 

= E E -^^{ai-xfh{x) + ^{nmf){x,ai-x)k{x). 

i=l |/3|<m ^' i=l 

But by part 1 of Theorem 70 the operator V preserves polynomials of degree < to. Hence 

i=l |/3|<m ^ |/3|<m 



= E ^{iy-^f)iy = ^) 

\f3\<m ^ 
= f{x), 

leaving us with 

M M 

^f{x + {ai- x)) k {x) = / (x) + E {TlmS) {x, ai - x) k {x) , 

i=l i=l 

and 

M 

Qf (a;) = - E C^rnf) {X, ai - X) k (x) . 

i=l 

Finally, by applying the remainder estimate A. 3 given in the Appendix we get 

\Qf{x)\ < (j2\hix)\^ iZ^\inmf){x,a,-x)\ 

I 



^(ei^^(^)i)# 



max 

\Si\=va 



^»=i ' \zeSA,x 
as required. ■ 

To study the convergence of the minimal interpolant when the data is unisolvent we will also need 
the following lemma which supplies the required results from the theory of Lagrange interpolation. 
These results are stated without proof. This lemma has been created from Lemma 3.2, Lemma 3.5 and 
Theorem 3.6 of Light and Wayne [7]. The results of this lemma do not involve any reference to weight 
or basis functions or to functions in X° , but use the properties of the data region O which contains the 
independent data points X and the order of the unisolvcncy used for the interpolation. Thus we have 
separated the part of the proof that involves basis functions from the part that uses the detailed theory 
of Lagrange interpolation operators. 

Lemma 72 Suppose first that: 



1. il is a bounded, open, connected subset ofW^ having the cone property e.g. %4-3 Adams [IJ. 



2.5 Minimal norm interpolation 
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2. X is a unisolvent subset of of order m. 

Suppose is the cardinal basis of Pm with respect to a minimal unisolvent subset of ft. Using 

Lagrange polynomial interpolation techniques, it can be shown there exists a constant K'^ „ > such that 

M 

J2\^j{x)\<K'a^m, a;eO, (2.47) 

and all minimal unisolvent subsets offl. Now define 

hx,n = sup dist (w, X) , 
wen 

and fix x € X. By using Lagrange interpolation techniques it can be shown there are constants 
co,m) hci^rn > suck that whcn hx,n < h^.m there exists a minimal unisolvent set A C X satisfying 

diam(AU {x}) < c^^„ihx.n- 

Before deriving the first interpolation error estimate f will show that there exists a suitable sequence of 
independent data sets X^^'^ c ^ with hx(k) q tending to zero. The proof is very close to that of Theorem 
57 which considered the case of a closed data region. 

Theorem 73 Suppose fl is a bounded, open set containing all the independent data sets. Then there 
exists a sequence of independent data sets X^'^^ C fl such that X^'^'' C X^''^-^^ and hxw q ^ as 
fc — > oo. 

Proof. For A; = 1, 2, 3, . . . there exists a finite covering of fl by the balls 

j-B ^a^,''''; I . Construct X^^^ by choosing points from O so that one point lies in each ball 

B ^a]^^; ij. Construct X^'''^^^ by first choosing the points X^''^ and then at least one extra point so 
that contains points from each ball B ^a^^^^; ■ 

Then a; e O n X^'^) implies xeB (a[f^ for some j and hence dist (x, X^*^)) < j. Hence hxw^Q = 

sup dist [x,X^'^^) < J and lim hx(k) q = 0. ■ 
xen k^oo ' 

Now we are ready to state our order of convergence result for the minimal norm interpolant for the 

case of a weight function with integer parameter k> 1. 

Theorem 74 Let w be a weight function with property W2 for integer parameter k > 1 and let G be the 
corresponding basis function. Set m = [kJ . 

Suppose Txf is the minimal norm interpolant of the data function f G X^ on the independent data 
set X contained in the data region fl. 

We use the notation and assumptions of Lemma 72 which means assuming that X is m-unisolvent 
and Q is a bounded, open, connected set whose boundary satisfies the cone condition. 

Now set ka = j^^^ (ca,.)'" max {D^^G (0) | . 

Then there exists ha^m > such that for x G fl 



\f{x) {Ix.f){x)\ < kG^{f-Ixf,f)^,,{hx.,nr < kcWflUihx.nr, (2.48) 
when hx,n = sup dist {uj, X) < /la^rn «-e- the order of convergence is at least m. 
Further, we have the upper bound 



\f{x)-{ixf){x)\<k'a\l{f-ixf,f)^^^{dis.u,nr, xen, 

'2 

(2.r^.^km max I 



where k'r^ = ^Z^/^ — :K'a ^ max iD'^'^G (0)1 . The constants cq m, K'a ^ and ha m only depend on O, m 

^ (27r) ' m! ^^>''^ I ^ 'I ' it,rn , ^ ± 



and d. 

In terms of the integrals which define weight property W2 we have 



lax |£)2'3g(0)| < (27r)-'^/' / 



max 
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Proof. The set X is unisolvent so from Definition 66 it must have a minimal unisolvent subset, say 
A = {a, , which we use to define the Lagrangian operators V and Q = I — V of Definition 69. Since / 
is a data function, / (x) — {Ixf) (x) = Q{f — Ixf) (x) and then by estimating |Q (/ — Ixf) {x)\ using 
Lemma 71 we get 



M 



\f{x)-Ixf{x)\< — iY,\h{x)\] max {f - Ixf) {z)\ \ mk\ai - . 

ml \ '-^ I \ \f3\=m ' ' I 8=1 



vi=l 



The next step is to consider the third factor on the last line. Prom part 4 of Theorem 34 



f~Ixf,{-DfR 



wfi 



(2.49) 



<\\f-ixf\L,o i-DfR. 

by part 4 of Theorem 34. Finally, by part 1 of Theorem 34, i?^ = (27r)"''/^ G{—z) so that (f^^iJ^) {z) 
{2Try'^^^D^0G{O) and 

1^^ (/ - Ixf) (z) I < (277)-''/' 11/ - Jx/|L,o \D'^G (0) I . 



Thus 



max \D^{f- Ixf) (z) \ < {27r)-''^' \\f - Ixf\t o I^'^'G (0) | , 



\0\=m 
zESa.x 



and so 



\f{x)-Ixf{x)\< 



|/3|=m ' 



M 



(27r)2 ml 



11/ - Jx/|L,o max l^'^'G (0)1 |/. m^ |a, - a;]™ • 

\p\=m \ — / J=l 



To estimate the last two factors on the right side of the this equation we will need the previous 
Lagrangian Lemma 72. In the notation of this lemma, if hx,n = sup dist {u, X) < /iQ,m then for a 



wen 



given X there exists a minimal unisolvent set A = {ai}f^-^ such that diam(Au{a;}) < cn,Khx,Q and 



M 



T.\h{^)\<K'n,n.- 
Thus \ai - x| < CQ^mhx,n and 

\f{x)-{Ixf){x)\< 



11/ - Jx/IL,o max \D-'0G (0)| K'^^, {cn,.hx,ny 
(27r)2 m! I'5|=™ 

= kG\\f-Ixf\L,oihx,n)"', 
and because ||/ - IxfWt^o = if ^ ^xf, f)^^o obtain 

1/ {x) - {Ixf) ix)\ < ka^if - Ixf, /)^,o ihx,n)"' , xen, 



(2.50) 



which is almost the required inequality. The extension of the last inequality to .x G is an easy 
consequence of the fact that / and Ixf are continuous on M''. The second inequality now follows directly 
from 2.12 and so the order of convergence of the minimal norm interpolant is at least m= [k\. 

" and so D^f^GiO) = (27r)-^ / (gf 



Finally, from 1.31 we have D'^I^G{x) = {2Tr)~^ J 
When |/3| = m 

{2n)-\D-^G{0)\<J 



w(0 



d^< 



m 



w{0 



dt 



Thus for an arbitrary data function the order of convergence of the minimal norm interpolant is at 
least [kJ . Using the same technique as Corollary 60 the following double rate of convergence estimate 
can be obtained: 
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Corollary 75 Under the notation and assumptions of the previous Theorem 74 the data functions Rx' 
satisfy 

\R.'ix)-{IxRx'){x)\<{kGf{hx,nf^^^, x,x'eK, (2.51) 
when hx,K < he i-e. the order of convergence is at least 2 [kJ. 

Exaiiiijlcs: radial basis riiiicl ioii iuli^i'ijolant 

In Subsection 1.2.3 it was shown that the weight functions of the shifted thin-plate splines and the 
Gaussian satisfy property W2 for all k > so [k\ G Z+. 

For the Sobolev splines the weight functions satisfy property W2 when < k < v — d/2. Thus 
[kJ = V — d/2 — 1 when v — d/2 e and v — d/2 > 2, and [/tj = [v — d/2\ when v — d/2 ^ Z+ and 
V — d/2 > 1. Compare these with the estimates of Subsubsection 2.5.2 as summarized in Table 2.1. 

Examples: tensor product extended B-spline interpolant 

Corollary 76 Suppose the weight function is an extended B-spline weight function with parameters 

2 < n < I. Then the order of convergence of the minimal norm interpolant to an arbitrary data function 
is at least n — 1. Further, the order of convergence to a Riesz data function Ry is at least 2n — 2. 

Proof. Theorem 7 implies that [kJ = n — 1 and Theorem 74 implies that the order of convergence is 
at least n — 1. Corollary 75 now shows the order of convergence to a Riesz data function Ry is at least 
2 [kJ = 2n - 2. ■ 

Exaiiii)lcs: suiiiiiiary tabic 



Interpolant error estimates 




Unisolvent data: k > 1 






Parameter 


Convergence 


Weight function 


constraints 


orders ([kJ) 


Sobolev sphnes 


t; - f = 2,3,4, .. . 


[v - d/2\ - 1 


{v > d/2) 




[v - d/2\ 


Shifted thin-plate spline 




2,3,4,... 


Gaussian 




2,3,4,... 


Extended B-spline 

(1 < " < /) 


n > 2 


n — 1 



TABLE 2.3. 



2.6 Data functions and numerical results for the non-unisolvent case 

In this section we will only be interested in the convergence of the interpolant to it's data function and 
not in the algorithm's performance as an interpolant. We will only consider the numerical experiments 
regarding the convergence of the interpolants generated by dilations of the one-dimensional extended 
B-spline basis functions 1.34 with parameters n and I. We will also restrict ourselves to one dimension 
so that the data density parameter can be easily calculated. 

We will divide our numerical experiments into those with n = 1 and those with n — 2. For the case 
n = 1 we will consider the two cases n = I = \ and n = 1 , / = 2 so that derivatives are involved, and for 
n = 2 we will use n = 2, Z = 2. The data region will be the interval [—1.5, 1.5]. 

Becaiise all the scaled extended B-splinc basis weight functions have a power of sin x in the denominator 
we will need to derive special classes of data functions which are convenient for numerical calculations. 
This will include characterizing the data functions locally as Sobolev-like spaces which makes it easy to 
choose data functions for numerical experiments. 

See also the discussion in Chapter 3 regarding the data functions generated by the central difference 
weight functions. There it is shown that locally the sets of data functions for the extended B-splines and 
central difference weight functions are identical. 
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2. 6. 1 Extended B-splines with n = 1 

The case n ~ 1, I = 1 

The hat weight function wa is defined by 1.19 and was discussed in Subsections 1.2.1 and 1.2.6. This 
is a scaled, extended B-splines basis function with parameters n = I = 1 and Theorem 29 implies 

max [k\ = 0. 

Now suppose n is the multivariate tensor product rectangular function defined using the 1-dimensional 
function 11 (f) = 1 when \t\ < 1/2 and 11 (t) = when \t\ > 1/2. We will now justiiy using data functions 

of the form u*n where u & (R'^) . Because fl {t) = (27r)-'/' and A {t) = (27r)-'/' (~[/^^ 

it follows that 

(n(0)' = (27r)-'^/'A(0, eeR''. 

Since H G L^, Young's inequality 

ll/*5ll.< ll/IUMI,, f€LP,g€L^, 1 + 1 = 1 + 1 l<p,g,r<oo, 

implies u*Il e (M.^) and u*Il = (27r)"'^/^ Ju{y)U{-- y) dy. Further u*n e implies u*n e L]^^ 
and if = X weight function 



llu*ni 



u n 



1/2 



12 ' 



so that u * n e X^^. Now u G implies u e Lj^q^ and we can define V G Lj^^ 

V{x)= j u {t) dt, ueL^ (M.'^) , 
Jo 

over the volume of the open rectangle TZ{0,x) so that 



by the integral 

(2.52) 



x+l/2 



u * 



n = (27r) ^ j u{y)U{x-y)dy = {2n) ^ j u{y) 



dy 



x-l/2 



Vlx+- 



and we note that any translate is also a data function. 

Observe that by the Cauchy-Schwartz inequality \V {x) — V {y)\ = u{t)dt < \\u\\2\x — yf^"^ so V 

is continuous and since u G {TZ (0, x)) equation 2.52 implies that D^V = u where 1 = (1, 1, ... , 1). 
To obtain our 1-dimensional data function fd we will choose 



u = e 



for which 
and 



V= (27r)5erf, 

/d = u * n = erf ( a; + 



2 = 2(27r) 
1 



ll^*n||^^_o = 2, 



erf 



(2.53) 



(2.54) 



For the double rate convergence experiment we will use A = V^ttRq as the data function. By part 2 
of Theorem 6, k < 1/2 so we must use the Type 1 error estimates of Theorems 59 and 58. These imply 
that if 

\x\ < ho, 



G{0)-ReG{x) < Cclxf" 
then there exists some ha > such that 



\fAx)-(Ixfd)ix)\<\\U\ 



w,0 ' 



■{kG{hx,KY ,VRo{0)Y xGK, 



(2.55) 
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when hx.K = maxdist (s, X) < ha- Here fee = (27r) ^ V2Cg, i?o (0) = (27r) ^ G (0) and X is an 

independent data set contained in the closed, bounded, infinite data region K. From Corollary 60 we 
have the corresponding double order convergence estimate 

\Roix)-{JxRo){x)\<mm{ikGf{hx,Kf\Ro{0)}, xeR\ (2.56) 

Since we are using the hat basis function Corollary 61 gives: G (0) = 1, Cq = 1, s = 1/2, ho — oo, 
and from 2.53 and 2.54, ||/d|L,o = 2- 

Numerical results 

Using the functions and parameters discussed in the last subsection the four subplots displayed in Figure 
2.1 each display the superposition of 20 interpolants. 



Data fn: erf(x+1/2) - erf(x-1/2) Filtered error 




-4 -2 2 _4 _2 2 

Log1 data density Logi data density 



Data fn: R^(x) Filtered error 




-4 -2 2 _4 _2 2 

Log1 data density Logi data density 

dim 1 , N=1 , L=1 , spl scale 1/2, sm parm 0, samp 20, pts 1 :3600, 
max ill-cond err -3.9:-5 , ill-cond pts 1148:2460 2460:3600 



FIGURE 2.1. Interpolant convergence: extended B-spline. 



The two upper subplots relate to the data function 2.54 and the lower subplots relate to the data 
function Rq i.e. the Riesz representer 1.37. The right-hand subplots are filtered versions of the actual 
unstable interpolant. The data function is given at the top of the left-hand plots and the annotation at 
the bottom of the figure supplies the following additional information: 



Input parameters 



N = L = 1 - the hat function is a member of the family of scaled extended B-splines with the indicated 

arameter values. j * w ■ i 

/ , , r , /I , \ , / , , Student Version of MA 

spl scale 1/2 - changes basis function scale (dilation): x = x/ spl scale. 
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sm parm - the smoothing parameter is always zero for interpolation. 

samp 20 - the sample size i.e. the number of test data files generated. The data function is evaluated 

on the interval [-1.5,1.5] using a imiform (statistical) distribution. 

pts 2:3600 - specifies the smallest number of data points 2 and the largest number of data points 
3600. The other values are given in exponential steps with a multiplier of approximately 1.2. 

Output parameters/messages 

max ill-cond err -3.5:-4.2 - this relates the ill-conditioning of the interpolation matrix to the 

interpolation error. A number or aster * preceding the colon refers to the data function fd and a number 
or aster after the colon refers to the data function Rq. An aster * will mean there were no ill-conditioned 
interpolation matrices generated by the data function. Here -3.5 means that the largest (unfiltered) 
interpolation error for which a matrix was ill-conditioned was 10^^'^. 

ill-cond pts 1234:3600 1499:2963 - the first colon-separated group corresponds to the data function 
fd and indicates the smallest and largest numbers of data points for which the interpolant matrix was 
ill-conditioned. Asters indicate no ill-conditioning. The second colon-separated group refers to the data 
function Rq. 

Note that all the plots shown in this document have the same format and annotations. 

As mentioned above the interpolants are filtered. The filter calculates the value below which 90% of 
the interpolant errors lie. The filter is designed to remove 'large', isolated spikes which dominate the 
actual errors. The interpolation error is calculated on a grid with 300 cells applied to the domain of 
the data function. No filter is used for the first five interpolants because there is no instability for small 
numbers of data points. 

As the number of points increases the numerical smoother of Rq is observed to simplify to three very 
large increasingly narrow spikes at ±1.5 and 0, and these dominate by about three orders of magnitude 
a residual stable error function of uniform amplitude and zero trend. 

The smoother of fd consists of intermingled spikes of various heights superimposed on a trend curve of 
amplitude comparable to the average spike size. The maximum spike height is at most about one order 
of magnitude of the average spike height. At the boundary, where the trend is near zero, there are often 
two spikes which are narrow w.r.t. the interior spikes. 

The (blue) kinked line at the top of each subplot in Figure 2.1 is the theoretical upper bound for 
the error given by inequalities 2.55 or 2.56. Clearly for the data function fd the theoretical boimd of 
1/2 underestimates the convergence rate by a factor of about four - assuming the filtering is valid. The 
situation for the data function Rq is more complex. The double theoretical convergence rate of 2s = 1 
might suggest that the actual interpolant converges twice as quickly for the Riesz data function and the 
filtering reinforces this suspicion. However, there is no nice linear decrease observed and the error of each 
interpolant decreases in large steps to the stable state. 

It is also very interesting to note that the theoretical upper error bound for the data function Rq is 
able to take into account quite closely the instability of the interpolant. 

The case n = 1, / = 2: 

Amongst other things, the following result will equip us with some tools to generate data functions for 
the extended B-splines data spaces, data functions for which the X'^ norm can be calculated This result 
is closely related to the calculations done above for the hat function. 

In the sequel translations will be defined by Tcf (x) ~ f{x — c), c € M'*, and dilations given by 
axf (x) = / (x/A) where A G and X > 0. U x,y E then x.y = (xiUi) denotes the component-wise 
product of X and y whereas the scalar or inner product is denoted by xy or {x,y). 

Theorem 77 Central difference operators and data functions for B-splines 

Suppose for c e R"^ we define the central difference operator d" by = S^^^^S"^^'^ . . . 6^^^'^ where 



and use the abbreviation = 




(2.57) 




(2.58) 
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Next, suppose w is the extended B-spline weight function with parameters n and I given by 1.20, and 
that in the sense of distributions D°'U e L'^ (K'*) when a< n. 
Then, regarding data functions, if fa = 62'U then fd S X° and 



ll/<ilL,o = 2"||^"'t^|l2- 

Further: 

1. Translations are isometric isomorphisms from X° to X° . 

2. If m > is an integer then the dilation am is a continuous mapping from X'^ to X^. 



(2.59) 



3. The embedding X° 
max ||-D"w||^. 

|a|<n-l" "°° 



c 



{n-1) 



is continuous when C 



(n-l) 



is endowed with the supremum norm 



4- Suppose V € eg" (W^) . Ifv has periods 1 then vSi^U = 62' {vU) e X^ for all l.Ifv has periods 21 

then vSi^U = S^^ (vU) G X^l when I is even. 

Proof. Equation 2.58 can be easily proved in one dimension and then for arbitrary dimension using 
the definitions of (^) and P < a. Since sfu = e'^^U - e-'^^U = (2isin^fe)C/ it follows that U = 
(2isin^fe)''^ U. Now using the Cauchy-Schwartz theorem it is easy to show that any function is Lj^^. 

= ||?7||2 and so [/ e Lj^^ and fa G L\^^. Again by 



Further, Plancherel's theorem states that 



U 



Plancherel's theorem D^^U e implies llf't^l 



U 



. Consequently 



\w,0 



j VU 


fd 


2 r ^2nl 


{2ism^kf^U 


J (sin^fe)^'^ 









(2.60) 



= 2 



2ld 



I 



2nl 



U 



= 2^''^ ||D"^f7| 



and fd € X^ which establishes 2.59. 

Part 1 The stated properties of translations Tc follow directly from the facts that TcT-c = t-cTc = 1 
and |fcw| = 

Part 2 Regarding the dilations, by using two changes of variable we obtain 



(^mfd\\l,fl = J w{t) (Tmfd {t)dt = m'^^ J w{t) fd{mt) 



m 



t 

w I — 



m 

t \2nl 



fdit) 



dt 



dt 



fdit) 



1 



^(2n-l)d 
1 



i2nl 



(sin 
(sin tk) 



211 



211 



dt 

fd{t) 

^2n: 



dt 



211 



fd{t) 



dt 



1 f / sin t \ 



dt 



<^(2(;-„)+i)d||y^| 



t«,0 ' 



smce 



sin Sfc 
sin — 



< m for all Sk G IR^ and all integers m > 1. This proves continuity. 



(2.61) 



Part 3 From part 6 Theorem 34 the embedding X^ ^ Cg is continuous and 1.21 allows us to choose 

K — n ~ I. 
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Part 4 Firstly, U G implies vU G . Further, since L'^ C S' we have by virtue of Leibniz' formula 
for tempered distributions S' 

I3<a 

Now e foj. a<n, so that £»" {vU) G when a < n. Since /d € C 5" C V, if has 

periods 1 then in the sense of distributions, in one dimension 2.58 implies 

vd'^-U = V ^ (-1)^' (J.) [/(• + (/- 2j) eu) = (-1)' ( •) i^U) (• + (/- 2j) e^) = S^T" {vU) . 



Consequently vfd = vd'^^U = 5^2 i'^^) ^^'^ thus 2.59 of this theorem implies vfd G Finally, it is 
now clear that if v has periods 21 and I is even then vfd = {vU). ■ 
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1. Norms of data functions Further to the comment preceding the last theorem, if D°'U € L'^ (M'') 
for a <n then fd = 62'U is a data function with norm 2'** ||Z)"-^f7||2 and translating this function 
does not change the norm. In addition, if v G (R'') has periods 1 when I is odd or v has periods 
21 when I is even, then vfd = {vU) is also a data function with norm 2*'' ||£)"-^ (uJ/)!!^. 



2. Regarding part 4 of the last theorem The condition v G (R") can be weakened to v G 

j^n,oo ^^d^ ^ {f gL°° : D'^f gL°°, < |a| < n} is the L°° Sobolev space introduced below in 
Definition 80. This can be proved using the Leibniz formula D (vf) = vDf + {Dv) f, v G 
/ e which can in turn be demonstrated by adapting the proof of Theorem 14-2, Section I4.I4 
Petersen [11]. 

Our basis function is the (unsealed) 1-dimensional extended B-spline G\ with parameters n = 1 and 
I = 2 given by 1.34 i.e. 

Gi it) = (-1)'-" ((*A)')) (I) = -f {D' (A * A)) (I) 

for tGR^. But 

£)2 (A * A) = A * D^A = A* {S {■ + 1) - 26 + S {■ - 1)) 

^ (A(- + 1)-2A + A(--1)), (2.62) 



so that 



and hence 



Gi(i) = -^(A(| + 1)-2A(|)+A(|-1)), 



DG, it) = (A' (I + 1) - 2A' (I) + A' (I - 1)) , 

i.e. ll-DGilloo = ^v^. Now by Theorem 43 

Gi(0)-Gi(0< pGilUI^I, xGR\ 

which means that 

Gi(0) = ^, Cg = \\DG^\\^ = ^V2^, s = l, ha = 00. 

With reference to the last theorem we will choose the bell-shaped data function 

fd = 6iUGX^, (2.63) 
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where 



so that 



U{x) = 



-fel,2X 



-fel,2X 



^1 (e-'^i.^^') (0) 2(l-e-4'=i.2)' 



0.3, 



(2.64) 



The interpolation error estimates given by 2.55 and 2.56 are now 

\fdix) - (Ixfd) {x)\ < ||/,|L_omin{fcG {hx,Kr , /MO)} 

and 



xG K, 



'0 (x) - i^xRo) < min |(fcG)^ {hx,K 



X e 



respectively, where kc = (27r) * y/2C^ and i?o (0) = (27r) = d (0). 

For the theory developed in the previous sections it was convenient to use the simple, unsealed weight 
function definition 1.20. However, I have observed that scaling can significantly improve the performance 
of the basis function interpolant and so I will present the following theorem for the scaled, extended 
B-splines. 

Theorem 79 Suppose G (x) = Gi (xk) where Gi = (—1) ~" £)2('-") ( (^^/^J \ and A is the univariate 

k=l ^ ' 

hat function and n, I are integers such that 1 < n < I. 

For given X > 0, G (Ax) is called a scaled extended B-spline basis function. The corresponding weight 

function is w\ (t) = {2Xa)^w (^) where a 



1/2 



220-V)+i o-i^d w is the extended B-spline weight function 
1.20 with parameters n,l. Indeed, w\ has property W2 for k iff w has property W2 for k. Further 



G(0) -G{Xx) < 2Xa-'^VdGi {of~^ \\DGi\\^ \x\, xG 



(2.65) 



Finally, if fd e and g^ {x) — fd (2 Ax), it follows that gd G X-^ and \\gd\ 



,d/2 



Wfd 



Proof. From 1.34, G {x) = a '^G (2x), where G is the extended B-spline basis function with parameters 
n and I. Hence by Theorem 23 the corresponding weight function is w\ {t) = {2Xa)'^ w (^) with property 
W2 for K. By Theorem 43 



X e 



G{0)-G{x) < VdGi (0)'^-'||£)Gi||^|a 

and because G (Ax) ~ a^'^G (2Ax) we have 

G (0) - G (Ax) = a-"^ (G (0) - G (2Ax)) < a-'^VdGi {of~^ \\DGi\\^ |2Ax| 

= 2Xa-^VdGi (0)''-i||Z)Gi||^|x|. 

Finally, ga (t) = (2A)-'^ fd (^) and so 



(2A)-"/.(^ 



dt = a'' jw (s) \fd {si\ ds = a-" \\fd\\l^^ . 



Numerical results 

Using the functions and parameters discussed in the last subsection the four subplots displayed in Figure 
2.2 each display the superposition of 20 interpolants. The two upper subplots relate to the data function 
2.63 and the lower subplot relates to the data function Rq i.e. the Riesz representer 1.37. The right-hand 
subplots are filtered versions of the actual unstable interpolant. The four subplots of Figure 2.2 each 
display the superposition of 20 interpolants. 

As the number of points increases the numerical interpolant of fd gradually simplifies to two spikes 
at the end points of the data interval with stable errors in between. The interpolant of Rq is observed 
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Data f n : S^iae'*"") Filtered error 




-4 -2 2 -4 -2 2 

Log 1 data density Log 1 data density 




-4 -2 2 -4 -2 2 

Log 1 data density Log 1 data density 

dim 1 , N=1 , L=2, spl scale 1 , sm parm 0, samp 20, pts 1 :3600, 
max ill-cond err *:-2.9 , ill-cond pts *:* 2976:3600 

FIGURE 2.2. Interpolant convergence: extended B-spline. 



to simplify to a large increasingly narrow spike at zero and this dominates a stable error function with 
absolute value of the order of 10~^. 

The (blue) kinked line above each interpolant at the top of each subplot in Figure 2.2 is the estimated 
upper bound for the error given by the inequalities 2.55 or 2.56. Clearly for the data function fd the 
theoretical bound of 1/2 underestimates the convergence rate by a factor of approximately four - assuming 
the filtering is valid. The situation for the data function Rq is more complex. The double theoretical 
convergence rate of 2s = 1 might suggest that the actual interpolant converges twicstOS^VV^hfofPSfUllATLAB 
Riesz data function and the filtering reinforces this suspicion. However, there is no nice linear decrease 
observed and the error of each interpolant decreases in a large step and then smaller steps to the stable 
state. 

It is also very interesting to note that the theoretical upper error bound for the data function Rq is 
able to take into account quite closely the instability of the interpolant. 



2.6.2 Extended B-splines with n = 2 

Since n > 2, Table 2.2 tells us we can use the Type 2 error estimates of Theorem 65 and the estimate of 
Theorem 58. 



The case: n — 2, I — 2 

From Corollary 79 the function 



^ , ^ (A* A) (2a;) 'W^tt ^, , , 
(A.A)(0) =^(A*A)(2x), 



2.6 Data functions and numerical results for the non-unisolvent case 
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is a scaled extended B-spline basis function with parameters n = 2, I = 2 such that supp 62,2 = [—1)1] 
and G2,2 (0) = 1. Prom 1.34 the scaUng factor is A = 4. To calculate G2,2 we use the convenient formula 



G2,2 {x) = (1 + xf A {2x + 1) + (1 - 2x'^) A (2x) + (1 - xf A {2x - 1) , 



(2.66) 



derived by observing that we require the symmetric equivalents of the B-splines 6*^ studied, for example, 
in Chapter 3 of Hollig [5]: Noting that A (x) = {x + 1), the symmetric equivalents of the convolution 
formula of box 3.11 and of the formulas of boxes 3.3, 3.4 can be used to derive 2.66. With reference to 

Theorem 77 choose the data function 2.63, 2.64 and for double rate convergence experiments we will use 

_ 1 

Rq = (27r) ^ G2.2 as the data function. 

Since n > 2, Table 2.2 tells us we can use the Type 2 error estimates of Theorem 65 and the estimate 
of Theorem 58. These imply that 



and 



\fd (x) - {Ixfd) ix)\ < ||/d|L,o min {kGhx,K, VRq (0)} , x&K,U& X^, 
\Ro{x)-{IxRo){x)\ < min [{kef {hx,Kf ,Ro{0)} , xgR\ 



(2.67) 
(2.68) 



where hx k = max dist (s, X) and in addition 



ka = (27r)-J ^J-G2,2 {of~' D^G2,2 {0)Vd = (27r)-^ ,J -D^G2,2 (0), 
i?o (0) = (27r)-5 G2,2 (0) = (27r)-5 , 
&nd.K= [-1.5,1.5]. 

Using 2.62 we have -D'^G2,2 (0) = -6\/2^D2 (A * A) (0) = 6\/2^^ = 12 so that kc = (27r)"^ Vl2. 
It remains to calculate g- from 2.63 and Plancherel's theorem 



ll/rflL,o = 2" ||^"^t/||2 = 4 ||i?^C/||2 = 4 



By 2.64, U {x) = so U {£,) = ^_ 



W^^=e '=1.2 and 



2 2(l-e-4fci,2)^ 
2 



*=1,2 



1/2 



(1 



-4fei 



ki,2 4/ ki^2 



1/2 



(l_e-4'=i,2)^/fc^ 2 



fci,2 4/A:i,2 f S^y^^ 



2(l-e-4fei,2)^/fcj^ 



= fcl,2A/fcl,2 ( — 



1/4 



3/4 



^ (fcl,2) 

2 l-e-4fei,2' 



and hence ||/dlL,o = 2v 

a"2 



4/9^ (fcl,2)^^^ _ 4/7^ 

• JFTT — V '^TT 



(fci,2) 



3/4 



l_e-*'=i,2 



To summarize: 



ka = 



1/4 



; II/. 



dWw.O 



= \/72n- 



3/4 



-4fei ■ 



fci,2 = 0.3; ho = 00. 



Numerical results 

Using the functions and parameters derived above the 4 subplots of Figure 2.3 were generated. Each plot 
displays the superposition of 20 interpolants with unfiltered output on the left and filtered interpolants on 
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Data f n : S^iae'^""') Fi Itered error 




-4-2 2 -4-2 2 

Log1 data density Log1 data density 



Data fn: R|j(x) Filtered error 




-4-2 2 -4-2 2 



Log1 data density Log1 data density 

dim 1, N=2, L=2, spl scale 1/4, sm parm 0, samp 20, pts 1:1500, 
max ill-cond err -3:-4.7 , ill-cond pts 230:1500 404:1500 

FIGURE 2.3. Interpolant convergence: extended B-spline. 



the right. Above the interpolants is a (blue) kinked hne whose slope indicates the theoretically predicted 
rate of convergence given by inequalities 2.67 and 2.68. There is also an adjacent line which represents 

an estimate of the slope implied by the filtered interpolant. For fd — ^2-^ — ,'4^^^ ^-^ the estimated rate 

of convergence is n — 1 = 1 and for Rq the estimated convergence rate is 27?. — 2 = 2. 

As the number of points increases both numerical interpolants were observed to consist of spikes of 
various sizes but no dominant spikes. 

The (blue) kinked line above each interpolant at the top of each subplot in Figure 2.3 is thStthdentVais^on of MATLAB 
upper bound for the error given by inequalities 2.67 or 2.68. Clearly for the data function fd the theoretical 
bound of 1 underestimates the convergence rate by a factor of about 3 - assuming the filtering is valid. 
For for the data function Rq a theoretical bound of 2 underestimates a convergence rate of about 5. The 
double theoretical convergence rate of 2 might suggest that the actual interpolant converges twice as 
quickly for the Riesz data function and the filtering reinforces this suspicion. 



2.7 The restriction spaces {Q) 

Suppose C M'' is bounded and it; is a scaled, extended B-spline weight function. In this section we 
will use the central difference operators 2.58 to show that the local restriction spaces (il) contain the 
same set of functions as a special, local, Sobolev-like space, denoted (fl). We will then extend the 
class of weight functions to which these characterization results are applicable. This is useful numerically 
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because it is easy to generate functions in iJ™ (fi). However, calculating the norm of an extension to X° 
may be difficult. 

Wc start with some definitions and in these definitions we use the notation a < m where m is a 
non-negative integer and a € R'^ is a multi-index. This notation means: 

{a : a < m} = {a : ak < m for k = I, . . . , m} . 

Sec Definition 221 of the Appendix. 

Our next important data function result Theorem 90 will require some Sobolev space theory based 
on the comprehensive study [1] by Adams. 

Definition 80 Sobolev spaces H"''°° {Q) , H"''°°, H"" {Q) , if™, m = 0, 1, . . . 
For any open set O c M** and in the sense of distributions: 

(fl) = {u&L°° (n) : D^'u e L°° (f2) for \a\ < m} , 

with respective norms \\u\\^^ ^ = niax sup \D°'u{x)\ and \\u\\ = \\u\\^^^^. 

H"" (n) = {ueL^ (n) D°'u G {Vl) for \a\ < m} , 

1/2 

with norms ||u|L_2,n = E (in and ||u||„ 2 = II^^IL,2,m<*- 

\a\<m ^ ' 

We will also need the following Sobolcv-likc Hilbert spaces: 
Definition 81 The spaces H'J' (SI) , H^, (Q) , H^, m = 0, 1, 2. . . . 

(Q) = {uGL^ (Q) : L>"w G (n) for a < m} , 

(fl) = {u e -ff" (ri) : suppw is bounded} , 

1 /2 

with norms |||u||L_2,n = E (in \D"un and |||w||L_2 = lll"IIL,2,M<i- 

Remark 82 The work by Adams [1] is devoted to Sobolev space theory and a review of this study shows 
that many of it's results can be extended to the spaces H™ , which are clearly Sobolev spaces in the case 
of one dimension. In particular: 

1. Theorem 3.2 proves (fi) is a Banach space. 

2. Theorem 3.5 shows that iJ™ (fi) is a separable, reflexive, uniformly convex Hilbert space with 

inner products defined in terms of their norms. 

3. Lemma 3.15 holds. 

4. Theorem 3.18 proves using Lemma 3.15 that if has the segment property (^4.2) then the space 
of restrictions {^\n '■ <!> € Cg"} is dense in (fi). 

5. Theorem 4-32 (The Calderdn extension operator) If SI has a strengthened uniform cone 
property (^4-4) then using Theorem 3.18 it can be shown that there is a continuous linear extension 
£n ■ - termed an m-simple extension - which has a continuity constant dependent 
only on m and d. Note that when the boundary is bounded the strengthened cone property reduces 
to the uniform cone property. 

6. Theorem 4-26 holds. Using a reflection technique a strong m-extension operator is constructed 
(k-simple for k < m) from (fi) to H™ , where il is a half-space in or an open set having the 
uniform C™ -regularity property (^4-6) and bounded boundary. 
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7. Theorem 4-28 holds. Using a reflection technique a total extension operator is constructed i.e. 
the operator is a strong m-extension operator for all m> 1. 

8. The above extension operators commute with translations and dilations in the sense that TcSq = 
^Tc(,n)Tc and axSn = E^^^p-^ax. 

I next present further results concerning the iJ™ (O) spaces. Observe that the proof of part 4 uses 
both and Fourier transform theory and introduces an equivalent Fourier transform norm for H"^. 

Lemma 83 The space also has the following properties: 

1. The translation operator Tc is an isometric isomorphism from (il) to H™ (O — c). 
The dilation operator a\ is a homeomorphism from H™ (fi) to (0/A). 

2. If we define ( fl) = {(f> E : supp(/) C il} then multiplication by a (il) function is a 
continuous mapping from H'^ (Jl) to (fl). In fact there exists a constant Cm,d, depending only 
on m and the dimension d, such that 

\\\M\L,2,n < W \mm,oo lll«IIL,2,n , <^ e Co°° (O) , n e (Q) . (2.69) 

3. Extension Suppose fl permits an extension operator £q described in parts 5 to 7 of Remark 82. 
Then given e > there exists a continuous linear extension mapping Eq-^ : (f2) — > il™ (fi^), 
and which has a continuity constant depending only on e, m and d. 

This extension commutes with translations and dilations in the sense that TcEq,;s = i'To(n);£Tc md 

^. Embedding Suppose has the properties specified in part 3. Then C C^~^^ and (fl) C 
C^^^^ (fl) and these inclusions are continuous under the corresponding H^~^'°° supremum norms. 

Proof. Part 1 ff™ (fi) is closed under translations and dilations, the inverses of translations and dilations 
are translations and dilations respectively, and the continuity calculations are simple changes of variable. 

Part 2 Use the Leibniz theorem for distributions to expand (</)it) for each 1 < a < m to obtain: 
ll^*" II2 < II^IL.oo lll"IIL,2,0' where c'„_^ depends only on m and d. 

Part 3 Using Lemma 17 we construct a function 0n;£ G (fig) such that < 0n;£ < 1, (pn-e = 1 on 
and supp^Q;^ C 03^/4 c Q-e- This function satisfies ax(t)n;s = <Pax{ny,s/\ and Tc(f)n;e = 4>Tc(n);e- Now 
define 

Ea-eU = (t>n;e£nu. (2.70) 

An application of part 2 then proves continuity. Since the extension operator commutes with transla- 
tions and dilations we have 

and 

as required. 

Part 4 Abbreviate the extension operator to £ = £n- By part 5 Remark 82, if u e i?™ (Cl) then 
£u e -ff™. But the norm ||H||„ 2 equivalent to the Fourier transform norm 



/ ri {i+ekr\um'di, 
J fe=i 



and so for a < m 

r r d. _™ d rn_ 

d^ 



f c£u = [ in n {i+^iy^ n {^+e,y suio 

J J k=l k=l 

< [ n (i+^D"' ri (i+^D^ ^(0 

J k=l fe=l 
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An application of the Cauchy-Schwartz inequality now proves the inequalities J 
Thus £u e and the identity A.l: |^| 



2(m-l) 



< oo for a < 



part 2 Lemma 13 we have £u € C 



(m-l) 



E implies l^p"' e L^. Hence by 

|a|— m— 1 



and so u e C 



(m-l) 
B 



We will need some more properties of the operator 62^^ which was introduced in Theorem 77. 

Lemma 84 Some properties of . Suppose w is the extended B-spline weight function with parameters 
n and I. Then: 

1. Sf^ : H" —^ is continuous. 

2. Iffe and supp/ C (-l,!)"^ then h^d^f^f = f + A2if, where suppA2if C ([-1,1]'^^ and 

h,d = {-if 

3. fc/.d^l'"^ • ((—1) I)'') continuous extension mapping. 
Proof. Part 1 If / S then §V = (2isin^fc)^'^ / and 

^'Ifll^ = / 

•' \\w,o I I • i- \2;i 
J (sm^fc) 



{2i sin^fe)^'^/ 



< 2 



Aid 



2 

n,2 • 



Part 2 Prom 2.58 




Now supp/ C (—1,1) implies suppr2/3-2a/ C ([—1,1] j when (3 ^ Zl, and so supp>l2i/ C 

([-1,1]')'- 

Part 3 Follows directly from part 2. ■ 

The next result allows us to use the local space i?" (O) as a source of data functions when the 

interpolation data is contained in a bounded open set fJ. This substantially simplifies the generation 
of data functions in comparison to the approach used in Subsection 2.6.1 for n = 1 but calculating an 
extension of the data function and its norm may need some work. 

Theorem 85 Let w he an extended B-spline weight function on with parameters n and I. Suppose 

is an open, bounded set which permits one of the extension operators £q described in parts 5 to 7 
of Remark 82. Now if C is any open cube containing Vl and < e < dist (Q, C) then there exists a 
continuous linear mapping E : i?" (Q) — > X° and a linear mapping B : (Cl) — > such that: 

1. E = Efi-g + B, where Eq-^ : H" (fl) H" is the extension mapping defined by 2.70, 

2. supp Bf C C ° and supp Ea-gf c o-nd, 

3. E is an extension mapping. 

4. E = ki^d E (-1)'^ {^e)T^^En;e, where h^d = {-if (f)"'^ and, 

I3<21 
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5. \\Ef\\^^^<\ki,a\{'2mf''\\D'-^En;ef\\^- 

Proof. Parts 1, 2, 3 The relevant commutative diagram is 2.73. Let c be the centre of the cube C and 
choose an integer m > such that (C — c) /m C [—1, 1]''. We have (TmTc {^) = (O — c) /m C [—1, 1]''. 
Set fi' = (O — c) /m. Choose / e i?" (fi). By part 3 Lemma 83, (Ji/mT~cf € i?" (O') can be extended 

by the operator E^i^^i^ to as a function g G i?"Q ^(—1,1)''^ which has support in ^'^j^- Lemma 

84 therefore implies /cj,^^!'-'-^ is an extension of cri/^T-cf to X° . Part 2 Theorem 77 now implies that 

Tccrmh,d5f^ 9 e Xl. Set 

E f = Tc(Jmh,d5.2 EQi^simUi/mT-cf- (2-71) 

Since < e/m < dist ^(—1, 1)''^ ^, part 3 Lemma 83 and then part2 Lemma 84 allow us to 

write 

Ef = TcCTmh^^f^O-l/mT-cEn-ef (2.72) 

= TcCTrn {I + A21) CTilm.T-cEu;ef 
= En-ef + Tc<ym-A2l(Tl/mT-cEQ,-ef ■ 

Now set 

B = TcamA^2l'7l/mT-cEn-^, 
SO that SUppBf C T-cCTi/m (supp A2iai/mT-cEn.ef) C T-cCTi/m I]'') ) 

C C'' and thus is an extension. 

Regarding continuity consider 2.71. By part 1 Lemma 83 the operator ai/^T-c ■ (O) — > if" (Q') is 
continuous, the extension operator from iJ" (fi') to 

iJ^Q ^(—1,1)''^ is continuous by part 3 Lemma 83, : iJ"Q ^(—1,1)''^ is continuous by 

Lemma 84, and Team ■ -'^{i ^ -'^S is continuous by part 2 Theorem 77. 

H: (Q) - ^ ((-1, l)'') 

exten | exten | h^d^f^ (2.73) 

w w 

Part 4 Use equation 2.58 to calculate and then substitute into 2.72 and compose the translations 
and dilations. 

Part 5 If Ms even we start with 2.79. Using the estimates 2.61 and 2.59, for / e i?" (fi) 

= A;i,dm('-"+i/2)'^2" WD^^aymT-cEn-.efW^ 



ki,di2mf\\D''^En-ef\ 



2 ■ 



Definition 86 The restriction space X° (ft) 

Suppose that w is a weight function with property W2 and suppose O c M** is a open set. 

Define the vector space X^ (fi) = {r^u : u € X^] , where r^u denotes the action of the linear operator 
which restricts u to fl. Endow (fl) with the norm 

II^L,o,n=inf{NL,o:«e^S, rnu = v]. (2.74) 

Clearly \\rnu\\^ q q < ll'^lltoo ''^^ • -^w ~^ (^) continuous. This implies r^^v is closed and 
.0,0 = dist {0,r^\) it follows that ||t;||^_o,n = II^L.o for « unique u € X^ and M^^q^q is 



since \ v\ 



actually well-defined and a norm. 
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Remark 87 The dilation Q-'^l'^a^ : X^, (^1) X°^^ (cjc^^) is an isometric homeomorphism. 

Corollary 88 Let w be an extended B-spline weight function on M'' with parameters n and I. Suppose 
Cl is an open, bounded set which permits one of the extension operators £q described in parts 5 to 7 of 
Remark 82. 

Then -ff" (ft) ^ (fi) and this inclusion is valid for any scaled extended B-spline weight function 
with appropriately scaled Cl. 

Proof. In Theorem 85 a continuous extension E : ^ was constructed for the extended B-spline 
weight functions w so that by Definition 86, r^E : (51) ^ (ft) is a continuous inclusion. An 
application of Remark 87 now proves this corollary for scaled extended B-splines. ■ 

Observe that the local space Xj;', (O) can be regarded as the set of data functions for the minimal norm 
interpolation problem where the data is c;ontaincd in il, and Corollary 88 proves that if" (17) C X^ (f2) 
when w is a scaled extended B-spline weight function with parameters n < I. The minimum norm 
interpolation problem can be defined in terms of extensions of these spaces to M.'^ and the matrix equation 
for the interpolant only uses values of the data function on 17. Thus H" (O) can be used as a source 
of data functions for the interpolation problem when w is a scaled extended B-spline weight 
function. 

The last corollary will now be strengthened to characterize the data functions for the minimal norm 
interpolation problem for the scaled extended B-splines. 

Lemma 89 Suppose w is a scaled extended B-spline weight function with parameters n, I, and Q CM.'^ 

is a bounded open set. Then X^ (17) ^ H" (Jl). 

Proof. Assume first that w is an unsealed extended B-spline weight function. Suppose v & X° (17) and 
let be any extension of v to X^. Noting that n < Z, if a < n we have 

^ r'l = / —. ^ y (^)l ds= s «— y {s)\ ds 

J (smsj) J (sm Sj) 

/„2(nl-a) 
s'" , , \v'{s)rds 
(sinsi)'*"'""^ ' ' 

> / |w«(s)|'ds 



Ja 
Jn 

— II n"i)ii^ 

— ll-'^ ^ll2,n ' 



and so D"v € (17) for a < n, which means v € H" (17). Further, the definition of the norm on 
X° (17) implies ||Z)"t;||2 q < ||?^*||^ q n when a < n, verifying that the inclusion of (17) in X° (17) is 
continuous. 

An application of Remark 87 now proves this lemma for scaled extended B-splines. ■ 
Combining Lemma 89 and Corollary 88 we obtain our local characterization of the interpolation data 
sets: 

Theorem 90 Suppose w is a scaled extended B-spline weight function with parameters n, I and 17 C M.'^ 
is open, bounded set which permits an extension operator Sq described in parts 5 to 7 of Remark 82. 
Then (17) = X° (17) as sets and their norms are equivalent. 



2. 7. 1 Extensions of TheoremQO 

In Theorem 90 it was shown that i?" (17) = X° (17) where w is a scaled extended B-splinc weight function 
with parameters n, I and 17 C M'' is open, bounded and satisfies the uniform cone condition e.g. §4.4 
Adams [1] . Here we will extend the class of weight functions to which this result applies, although it still 
essentially only applies to tensor product weight functions. 

The operator (Jl'-*- has the property that 5|'-'-m(^) = (sin^fe)^'"*^ w (C) so that when multiplied by the 
weight function it's zeros can be used to cancel out the weight function poles, as in part 1 of Lemma 84, 
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leaving a function of polynomial increase of at most n. This ensures the continuity of (5|'^ : H'J- X° ; 
so (sin^fc)'"^ corresponds to g below and motivates condition 2.75. It was also shown in Lemma 84 that 

/ e H'Jq (^{-1,1)'^^ implies h^dSf^f = f + A2if, where supp>l2;/ C ^[-l,!]"^^ , and this prompted 

2.76, where 62^^ corresponds to g{D). 

Suppose fif is a bounded function for which there exist constants > such that 



When u e iJ" we have u G L"^ C S' and u G C Lj^^ and so boundedncss of g implies the pseudod- 
ifferential operator g (D) = (gu)^ maps to continuously with Hff (-D) q = ^ J w \gf \u 
max\/Ca |||m|||^_2- assume that 



2 < 



supp(<?(l?)/-/)c / e ifro ((-1, 1)') , (2.76) 

which means that g {D) : H^q {^~^'' "'"^'*) ~^ ^ continuous extension. The commutative diagram 

for the construction of the extension iJ" {Q) — > is now 2.77 cf. diagram 2.73. 

exten | exten | g {D) (2.77) 

-^w ■* 

From the diagram we see that am '■ X^ — > X^ must be continuous where m > is an integer. This was 
proven for the spline weight functions in Theorem 77 and inspection of this proof shows that continuity 
still holds if 

w{^/m)<cmw{0, ^eR^ (2.78) 

for some constant Cm > 0. However, m must be chosen so that (C — c) /m C [— 1, 1]'' where c is the 
centre of a cube containing Cl. Thus we have the following extension to Theorem 85: 

Theorem 91 Suppose: 

1. w is a weight function on R"^ . 

2. fl is an open, bounded set which permits an extension operator £a described in parts 5 to 7 of 
Remark 82 and let C be any open cube with center c containing Q.. Set < e < dist (O, C"^). 

3. there exists a bounded function g with properties 2. 75 and 2. 76. 

4- there exists an integer m > for which 2.78 holds and (C — c) /m C [—1, 1]''. 

Then there exists a continuous extension mapping E : iJ" (O) — > X^ and a continuous mapping 
B : (n) X^ such that: 

1. E = E^i-s + B, where Sq;^ : if" (ft) is the extension mapping defined by 2.70, 

2. supp Ea-gf c fie o,nd suppB/ dC " . 



-d 



3. E = ki^d E (-1)'^ { 0)^20-21) /^En-e, where ki,d = (-1)" ( , ) and, 

i3<2l 

4. ||i;/|L,o < (2m)'" \\D^^En-j\\2. 

We now have an extended version of Corollary 88. 
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Corollary 92 Let w be a weight function and fl be an open set with the properties described in Theorem 
91. 

Then i?" [0) ^ (O) and this inclusion also valid for any scaled weight function with appropriately 
scaled il. 

Proof. From Theorem 91 there is a continuous extension E : ^ X'^ so that by Definition 86, 
mE : (Q) — !■ X° {n) is a continuous inclusion. An application of Remark 87 now proves this corollary 
for scaled weight functions. ■ 

The next result characterizes the data functions locally. 

Corollary 93 Suppose the assumptions of Theorem 91 hold and we also assume that there exist constant 
Cq, > such that 

w (C) > CcC^", ^&Q. anda<n. (2.79) 
Then X^ (f2) = iJ" (f2) as sets and their norms are equivalent. 

Proof. Inspection of Lemma 89 reveals that it still holds when the weight function satisfies property 
2.79. Thus Xl (O) ^ (O) and by Corollary 92, X° (O) = (O) as sets with equivalent norms. ■ 

Remark 94 // the inequalities 2. 75 and 2. 79 in Theorem 91 and Corollary 93 are replaced by ones in 
which the multi-index has range defined by \a\ < n instead of a < n then both the theorem and the 
corollary are still valid but with (Cl) replaced by the Sobolev space (O) and H^q (O) replaced by 

The Sobolev splines 1.15 are a case in point: here w (^) = ^1 + |^|^^ . // we set g = 1 then by the 

identity of part 6 Definition 221, conditions 2.75 and 2.79 are satisfied for Ca = Ca = (^), clearly 2.76 
is true and finally 2.78 is true for Cyj = 1, fl = 'R'^. Thus (Q) = H" (Q) which confirms the result of 
Example 1.3.3. 
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Central difference weight functions 



3.1 Introduction 

To understand the theory of interpolants and smoothers presented in this document it is not necessary to 
read this chapter which introduces the central difference weight functions. However, we note that Section 
3.5 contains local data space results specifically designed for tensor product weight functions and that 
central difference weight functions are used as examples in the chapters dealing with smoothers. 

This chapter introduces a large new class of weight functions based on central difference operators and 
it is shown that they are weight functions. The basis functions are calculated and smoothness estimates 
and upper and lower bounds are derived for the weight functions and the basis functions. 

Like the B-splines, their data functions are characterized locally as Sobolev-like spaces. 

Pointwise convergence results are derived for the basis function interpolant using results from Chapter 
2. These give orders of convergence identical to those obtained for the extended B-splines in. This chapter 
presents no numerical interpolation experiments. 

This is a large class of weight functions and I have only calculated a couple of basis functions. The 
calculations can be tedious so one really needs to be able to have theoretical results which enable you 
to tell beforehand that the basis function has some especially good properties regarding interpolation - 
I have no results to offer in this regard except Remark 112 which might mean that it may be possible to 
usefully characterize the basis functions or characterize a large subset of the basis functions. 

The central difference weight functions are defined as follows: Suppose q G (R-*^) , g ^ 0, g (^) > 
and I > n > 1 are integers. The univariate central difference weight function with parameters n, I is 
defined by 

<r2n 

where A21 is central difference operator 



For example, A2/ (^) = — (/ (0 " 2/ (0) + / {—£,)). The multivariate central difference weight function 
is defined by tensor product. 

It is shown in Theorem 98 that w belongs to the class of zero order weight functions introduced in 
Chapter 1 for some k iff / |^|^"~^ Q (0 < 00 for some R > 0. Here k satisfies k + 1/2 < n. The 

l?l>-R 

central difference weight functions are closely related to the extended B-splines defined by 1.20 and a 
discussion of their genesis is given in Subsection 3.2.1. 
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Various bounds are derived for w. For example, in Corollary 109 it is shown that if /|j|>^ i^'? (t) dt < oo 
then for any r > there exist constants Cr,c'^,kr,k'^ > such that 

c c' 

By Theorems 111 and 115 the univariate central difference basis function is 

G (s) = (-1)('-") J (d^C-") (*A)') (0 q {t) dt, seR\ 

with G e C|j^""^^ and D'^"'-^G bounded in C^^""!) (M\0). The multivariate basis function is defined 
as a tensor product. For fc < 2n — 2, D'^G is uniformly Lipschitz continuous of order 1 (Theorem 118). 

In Section 3.5 we will characterize the data function space locally as a Sobolev-like space 5. A local- 
ization result specifically for tensor product weight fmictions is derived in Theorem 123. This supplies 
important information about the data functions and makes it easy to choose data functions for numerical 
experiments. 

Results for the pointwise convergence of the interpolant to its data function on a bounded data domain 
are derived using results from Chapter 1. These results are summarized in Table 3.1 and give pointwise 
orders of convergence identical to those obtained for the extended B-splines. 

This chapter does not present the results of any numerical experiments. 



3.2 Central difference weight functions 

To introduce the central difference weight functions we will need some central difference operators: 

Definition 95 The univariate central difference operators 6^ and A21 on 
For a continuous function f we define the central difference operators 

S^f{x)=f(x + ^]-f(x-^), x,iGR\ 



and 

A2//(0= E (-i)'(fe+i)/(-fcO, / = 1,2,3,..., ^elR^ (3.1) 

k=-l 

This definition of A21 ensures that A2/,j (e~*^*) > 0, which is part 3 of the next lemma. For example: 

A2/(C)=-(/ (0-2/(0) + /(-?)), 
A4/ (0 = / m - 4/ iO + 6/ (0) - 4/ (-0 + / {-20 ■ 
Lemma 96 The central difference operators of Definition 95 have the following properties: 

1. A^JiO = {sff) (0). 

2. Regarding monomials and polynomials: 

(a) A21 (e^™) = ^ (-1)'= {^l^)k^A when m > 0; 



0, if m < I, 

L)'(20!, ifm. 

(c) A21 (e^^+i) = 0; 

(d) A21 (p(C)) = when p is a polynomial with degree less than 21; 

(e) A21 (C'O =(-!)' (20!^''; 



3. A2i,5 '^*) = 2^'sin^' (C*/2)- Here A2;,j indicates A21 is acting on the variable ^ in e 
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3.2.1 Motivation and definition 



The derivation of the central difference weight function is based on the approximation of the reciprocal 
of the B-spline weight function Wg using a moUifier. Prom 1.20 we have for parameters n and I 

1 sin^'^ , 

^ l<n<l. 



- — ^ J so that by part 3 Lemma 96 with t = 2, = ^'^a^ — - or using 

the Fourier transform [•] (^) 

1 ^ {2n)'^^ A2iF[5{x-2)] 

We now want to approximate the delta function 6 in the distribution sense using the sequence of 
mollifier ktp (kx) where k = 1,2,3, ip G (^^) ' ^^'^ / V' = 1- Since kip {kx) — > 5 as — > oo in 

the sense of distributions, kip {k {x — 2)) — > 5 (a; — 2) and so as — > oo 



2.i,,,F[kHHx-2))] , 1 ^^^,(^.^0). 



22' ^2n y,^ 

Here we will not try here to extend the convergence to 5' but observe that we can write 

^ A21F [k^ {k {x - 2))] _ A2iqk 



22/ ^2n ^2n 



where 



Ik (x) = ^kiP {k {x - 2)) G (Ri) and qu > 0. 



Thus w^{^) '^'^ {^^\^) f-iid if wc arc lucky A2fgi:(C) ^^■^ ^ weight function. Indeed, we 

use this idea to define a central difference weight function and justify calling it a weight function using 
the next theorem. 

Later in Section 3.4 we will show that if an extra condition is imposed on the function ip then — > 
^ ^^^^ in and as tempered distributions, and that the corresponding sequence of central difference 
basis functions converges uniformly pointwise to the extended B-spline basis function. 

Definition 97 Univariate central difference weight functions 

Suppose that q & i^^)' 9 7^ 0; 9(C) ^ and l,n>0 are integers. The univariate central difference 
weight function is defined by 

A2iq{K,) 

In d dimensions the central difference weight function with parameters n and I is the tensor product 
of the univariate weight function. 

The next result justifies these definitions. 

Theorem 98 Suppose w is the function on M.^ introduced in Definition 97. Then w is an even function 
satisfying weight function property Wl for A = {0}. Further, w satisfies property W2 for parameter k 
iff I and n satisfy k + 1/2 < n < I and q satisfies 



I 



< 00, for some R>0. (3.3) 



\i\>R 

Proof. The weight function w is an even function since it is clear from equation 3.1 that A2iqis an even 
function. Prom part 5 of Lemma 96 we have A2;,|e~'^* = 22' sin^' {^t/2) so that 

= 7Sjt/™"K'/2)9W<i*- (3.4) 
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From Theorem 98 we have w > when ^0. Since q € impUes q€ Cg \ from the definition of 
w we have w e C^°^ (R^\0) and w; > on M^\0. 

Regarding property W2, choose A > 0. Again using Theorem 98 

where, because the integrand is non-negative, the integrals all exist iff the last integral exists. Now 
make the change of variables r] = so that 



/ 



W (^) V2^F J J fj2{n-X) 'II W 



22((-Ti + X)- 



Since Ws is an extended B-spline weight function the first integral of 3.5 exists iff < A < k and 

K+1/2 < n < I. Further, since q G L^, J q (t) dt exists iff / q (t) dt < oo for 

|t|>fi 

some i? > iff / \tf^"~^^~^ q {t) < oo for A = and some R> 0. Thus it can be seen that w has 

\t\>R 

property W2 for K iff k;+ 1/2 < n < Z and / 9 (t) < oo. ■ 

\t\>R 

The central difference weight function is related to the extended B-spline weight function as follows: 

Theorem 99 Suppose Wg is the extended B-spline weight function with parameters n, I. Suppose Wc is a 
central difference weight function with parameters n,l,q{-) and property 3.3. Then these weight functions 
are related by the equation 

^ ^ ^ t^''q{t)dt. (3.6) 



Proof. From equation 3.4 of the proof of Theorem 98 



1 2 /'sin2/(^i/2)g(f)rft=2!l^ / sin (eV2) 2n (^)^^ 

- J 



3.2.2 Weight function bounds and sm,oothness 

We will be mostly concerned with smoothness properties and deriving lower bounds for the weight 
function. It turns out that the weight function is continuous except at the origin where it may have a 
singularity. 

The next theorem will give some properties of the central difference weight function if condition 3.3 is 
strengthened to 3.7. 



Theorem 100 Suppose w is a central difference weight function with the properties assumed in Theorem 
98. Suppose in addition 

j (t) rft < 00. (3.7) 

|t|>fl 

Then l/w € C^°^ (Ri) and 



y 1 _ / 0' if n<l, 

^Jt^-q{t)dt, ^f n = l. (^'^^ 



Also, there exists a positive constant Ci such that 



^ e^O. (3.9) 



w (0 " ' 
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Proof. From Theorem 98 we know that w is continuous and positive outside the origin. Thus 1 /w is also 
continuous and positive outside the origin. We now show that 1/w is also continuous at the origin by 
proving the limits 3.8. The existence of the integral / i^"^ (t) dt implies t'^^q {t) G and soqG 

\t\>B. 

by part 2 of Lemma 13. Hence the Taylor series with integral remainder of Appendix A. 7 can be used 
to expand each term of 3.1 about the origin up to a polynomial of degree 2n — 1. By part 3 of Lemma 
96 A^' annihilates polynomials of degree less than 21 and so: 



A'^m = E (-1)'" ikim-ko 

k=-l 

= E (-')"" (.;,)E^^M«)'+ 

k=-l j<2n •' 

+ (-1)'" (/i) (^^, [ (^'9 * 

j<2n ■' k=~l 

+ E (-1)'" LI) ^0^, f ^^"-^ (^^"5-) {{s - 1) ko ds 
E (-1)'" 10Y)\ [ (^'"^) «^ - ^) 

^ E (-1)" (.+0^'" / s^--'{D'-^{{s-l)kOds. (3.10) 



(2n- 



fe=-; 



Thus 



A2'q(e) 



w; (0 C^" (2n 



-1^ E^ (-1)"^' Ll,)k'- [ s'-' (D^-q) {{s - 1) kO ds. 



Because D'^^'q e C^°^ we can apply the Lebesgue-dominated convergence theorem to the sequence 
{s2"-i [D^n^) ((1 - s) (-J )) }~ ^ to obtain 

hm = TTT^—TTT y (-1)"=' Ll,)k^" t s'""' lim {{s - 1) fcO 



0, ?;/ n < 

rt^^q(t)dt, i f n = l. 

where the last step used property 4 of Lemma 96. 

The next thing to prove is inequality 3.9. But from equation 3.4, 



w 



Thus 1/w is bounded outside the unit sphere. Since l/'w is continuous everywhere it now follows that 
1/w is bounded everywhere. ■ 

The next result places the generally stronger condition 3.11 on the function q in exchange for estimates 
of the weight function behavior near the origin. 
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Theorem 101 Suppose w is a central difference weight function with parameters n, I, q which satisfy the 
conditions of Theorem 98, and also suppose that 



j 

J\t 



t^^q{t)dt < oo. 



\t\>R 



Then 



lim 



[ t^^q [t) dt, n<l, 



(3.11) 
(3.12) 

(3.13) 



C-O W (0 C2(/-n) 2> 

and given a> there exist constants Ca,c'^> such that 

ca<«^(e)e'<'-"^<cl, |el<a. 

Regarding smoothness: w € C^^O (Mi\0) anrf w (0 C^^'""^ e C(°). 

Proof. Assumption 3.11 implies that qG C^''. Hence we can set n = Z in the proof of equation 3.10 of 
Theorem and obtain 

(0 = T^rrni E (-1)'" ikli)'^"' f (^''?) ((^ - 1) ^0 ds, 



or, since 



1 



: ^-2"A2'§-(^) for n<l, 



w 



(^) ^2(;-n) 



FoUowing the rest of Theorem 98 we obtain the hmit 3.12. 

To prove 3.13 note that since w is continuous, 3.12 imphes that there exists ai > such that 



1 



and so 



or 



Now choose 



1 



dt 



< 



t^^q (t) dt < 



< 



^jt''^q{t)dt, 1^1 <ai, 
jt''q{t)dt, 1^1 <ai. 



2V27r 



Jt^^q{t)dt 



Ca = 



2V27r 



Jt^^q{t)dt' 



Jt^^qit)dt' 



3Ca. 



If a < ai we are done. If a > ai we use the continuity of w and 3.9 to prove that there exist Ci , C2 > 
such that < Ci < w (^) < C2 for ai < |^| < a. The inequahties 3.13 then follow easily. 

Regarding smoothness, we noted at the start of this proof that q € C^'^ and consequently A^'^jt e 



(21) ^ 
Cg . Further, equation 3.4 implies that A2iq{^) > when ^ ^ 0. But w (^) 



C(2') (R\0). Finally, 3.12 of this theorem imphes that Hm 



exists and is positive. Thus 



and so w e 



lirn w (^) 5^'' exists and so the function w (^) ^^^^ is continuous. ■ 

A multi-dimensional analogue of Theorem 98 is: 

Corollary 102 Suppose the functions {qj}'^^i satisfy qj € Qj 0> Qj {^) > 0. Define the tensor 

product function w (^) = Yl ^21 for integers l,n>0. 

Then w is a weight function on M.'^. We call it a tensor product central difference weight function. 
Further, w satisfies property W2 Jor parameter k iff I and n also satisfy the conditions of Theorem 98 
i.e. K + 1/2 < n< I and J|j|>^ \t\ "~ qj {t) dt < 00 for some Rj and 1 < j < d. 
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Proof. That w satisfies property W2 for k since the tensor product of univariate weight functions having 
property W2 for k also satisfies property W2 for k. This is part 2 of Theorem 4. ■ 

Remeirk 103 The weight functions of Corollary 102 can he further generalized by choosing different 
values of n and I for each dimension. 

The next result gives some weight function properties which follow directly from Theorem 101. 

Corollary 104 Suppose w is a central difference weight function with parameters n, I, q which satisfy 
the conditions of Theorem 101. Suppose in addition that I = n. Then w satisfies: 

1. weC^°^ (Ri) nc(20 (Mi\0). 

There exist constants ci , C2 > such that: 

2- w (^) > ci for all ^ and, 

3. w (0 > caC^' for all 

4. lim w (0 = J t'-q it) dty\ 



3.2.3 Upper hounds for the weight function 



To derive upper bounds on central difference weight functions we first derive the weight function for the 
case when g is a rectangular function. 



Theorem 105 Suppose b> a and qn is the rectangular function 



1, ^ e [a, b] , 
0, ^^[a,b]. 



(3.14) 



Then q = qn satisfies the conditions of Theorem 98 for n = I = 1 and the corresponding weight 
function is 

'/27 e 



b-a _ 
2 



with bounds 



Proof. Prom 3.4 



6 (6 - a) 



e, 1^1 > ' 



2 (b-a) 



b — a 



(3.15) 



(3.16) 



The integral is an even function of ^ so now assume ^ > 0. We now use the change of variables s = ^t/2, 
ds = (^/2) dt with new range [a^/2, b^/2]. Hence 



J a 



sm —at 



K/2 



i Jai/1 



sin sds = - 



s sin 2s 
2 4~ 



2 fb 



a 



64/2 

sin b^ — sin 



h — a sin 6^ — sin a^ 



2? 



b-a 1 



i sin 



so that 



1 fb-a 1 



(0 \ 2 C 



i sin 



b — a 
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as required. 

Regarding the inequality 3.16, if |^| > then < and 



2 « 



(VihKTii) " 8 8 (6-a)/4 



27r 



2 (6 - a) 



Also, since ||| < ^ 



Corollary 106 T/ie weight function wr of Theorem 105 is a Cg'p (M^) function. 
Proof. We continue wr to as the function of a complex variable: 



and note that 



, , v27r 1 1 

Wr (z) = —-7T ^ ■■ T, Z G <L , 

' A (o—a)2— (sin oz— sin az) ' ' 





{h-a)z- (sin hz - sin az) _ {b^ - a^) {b^ - a^) z^ (b^ - a^) z^ 
z^ ~ 3! 5! 7! 

is analytic and non-zero in a neighborhood of the origin. The theory of division of a power series 
implies that wr (z) is also analytic in a neighborhood of the origin. Thus wr is C°° in a neighborhood 
of the origin. 

To prove that wr is C°° away from the origin we write 

V2jr 1 ^ fh- a 

(¥) 

where 



WR (0 = —5- ,u ( ^T-? ) ' 



/ii(0= ^ 



It is clear that cos f C) = 1 iff ^ = 0, so that fR G C°° away from the origin and hence that 



w E away from the origin. Finally, since we can write (C)| ^ we have w G C'^p. ■ 

The previous theorem will now be applied to derive an upper bound for the central difference weight 
functions with parameters n = I = 1. 

Corollary 107 For the case n = I = 1 the central difference weight function w (^) = ^^^^^^^ of Theorem 
98 satisfies an inequality of the form 

w{^)<cwR{i), iG^}. 

where wr was defined in Theorem 105. 

Proof. By assumption q £ (^^)' 9 (0 ^ fo'^ ^ GM.^. Therefore, since an intcgrable function can 
be defined using simple functions i.e. the characteristic functions of intervals, there exist constants b > a 
and c > such that q (^) > cqR {£,) for ^ e [a, b] where qR be the rectangular function defined by 3.14. 
Then from 3.4 and Theorem 105 

J(^)=^4J 1^ ''^M^ [ 1^- w = U.R (0 ■ 

Ml 
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For an arbitrary central difference weiglit function inequality 3.9 gave an order 2n lower bound on the 
growth of a weight function at infinity. We now derive an upper bound at infinity which is also of order 

2n. 

Theorem 108 There exist constants < a < b such that the central difference weight function w (^) = 
— of Theorem 98 satisfies inequalities of the form 



^{0<{f^n (3.17) 



A2i«(«) 

where c\ and C2 are given by equations 3.21 and 3.23 respectively. 

Proof. The proof will involve calculating a lower bound for 1/w. Choose < a < 6 and cq such that 
w {C) > Co when ^ e [a, b]. Set 5 = b — a. We start with equation 3.4 so that 

Since w is an even function we can assume ^ > 0. Applying the change of variables s = £,t/2 gives 

\ > ^''po J / sin^isds. (3.18) 
Now b^/2 - = 6^/2 = kn + an for some < ct < 1, and 

I sin^' sd= I sin^' sds + / sin^' sds 

Jai/2 >/a^/2 J ai/2+kv 

fkir /■a^/2+(T7r 



/ sin^' sds + / sin^' sds 

Jo Jai/2 

rT^/2 pa^/2+{77z 

2k / sin^' sds + / sin^' sds. 

Jo Jaf/2 



<I2 



Since 6^/2 = kir + air implies 6^/2 < (fc + 1) tt and ^ - I <k, 

K/2 f^/2 /c^ X .7T/2 

sin^' sd > 2fc / sin^' sds > ( -f - 2 ) / sin^' sds. (3.19) 

a{/2 Jo 

We now split the range of ^ into the intervals: 0<^<|-,-|<^<^ and ^ > ^ and consider each 
as a separate case. 



TT 



Case 1: C > X 



Here A; > 2 and | < ^ so that from 3.18 and 3.19 

> 2!^^ J_ sin'i sds>t^J^(^-^-2\ sin^' sds 



^2(l-n)yj (^) - ^21+1 J^^^^ - V2^ ^21+1 ^ ^ 

o21 + l„ 1 



\/2¥ f2l 



1 5 r'^ 



7r/2 

sin^' sds 



sin^' sds 



V2^ ^2i27ryo 
1 (20! (5co 

2\/2^ ^^1^2 ^21 ' 
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and thus 
where 



4-7r 



ci = 2V27r 



1 



(20! (6 -a) Co 



Case 2: < ^ < f Here fc = and - = 5^/2 < Tr/2 so that sin s> and 

bC/2 /2 \2i 2' r g2Z+l -|''?/2 



5J/2 
«/2 



sin^' srfs > 



Thus by 3.18 



1 



> 



^2((-n)y^ — \/2¥ ^2i+l 



«/2 



2^^ 2 V TT / 2? 



sin^' sds > 2 £0 



> 



i_ (^2i+i _ ^2;+i^ 



V27f 



2' ^2/+l _ ^21 + 1 



2/ + 1 



-Co, 



and 
where 



C2 



^2(Z-n) ■ 



o<e<^, 



C2 



^277 



2i (•j,2;+i _ „2l+\ 



) Co 



2/ + 1 



Cas(> < L < ^ 



Here 



/■6{/2 /.7r/2 

/ sin^' sds> I i 

Ja(/2 Jo 



so that by 3.18 



> 



22' + ico 1 



271- tf2;+i / 

S J w 



bi/2 



sin^' sds. 



221 + 1 



1 



sin^' sds > , 
4/2 ^^'^ Jo 



7r/2 



sin^' sds 



- + icn 1 ^ (20! 



_ 2"' + -^ CO 



^2;+i 2 22/ 
2^ (20! Co 



2 (i\fe'+^' 



Thus 



and 



,..^< 2 (l!fe'"+i 2 (nf e'"4^_./^(Z!)' 1 ,2„_2, 
"^^^-71^M!^-71^M!^T-'^''^(20!^^ " 



Noting that w is an even function the inequaUties 3.20, 3.22 and 3.24 imply 3.17 as desired. 
We have the following results for central difference weight function near zero and infinity: 



(3.20) 
(3.21) 



(3.22) 
(3.23) 



(3.24) 



Corollary 109 Suppose w is a central difference weight function on with parameters n, I, q (•) which 
satisfies the condition 3.11. Then for any r > there exist constants Cr,c'j.,kr,k'^ > such that 

kre''<w{o<Ke^, \^\>r, 



and 



^2{l-n) 



^2{l-n) ■ 



1^1 < r. 



(3.25) 
(3.26) 



3.3 The central difference basis functions 
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Proof. Inequalities 3.25 follow from inequality 3.17 of Theorem 108 and inequality 3.9 of Theorem 100. 
Inequalities 3.26 are the inequalities 3.13 of Theorem 101. ■ 

Later wc will be interested in the sc^quence of central difference weight functions {wk} which corre- 
sponds to & = > and a = —k^'^ where /c is a positive integer. These weight functions have the 
following properties: 

Corollary 110 Suppose k > 1 is an integer and R is the unit rectangular function with support on 
[-1,1] c R\ 

Then qt {£,) = ^R{kS) satisfies the conditions of Theorem 98 for n = I = 1 and the corresponding 
central difference weight function is 

where 

Proof. The function ^R{k^) is a rectangular function equal to | on [— ^, ^] • Thus a = — and & = 
so that by Theorem 105 

k i k ''"^ k 



3.3 The central difference basis functions 

We will now determine the basis functions of zero order generated by the central difference weight 
functions of Definition 97. In Theorem 29 it was shown that a univariate extended B-spline basis function 
Gs is such that Gg e Cg^" (K"^) and D'^"^^Gs is a piecewise constant function. In the next two 
theorems wc will prove analogues of these results for the central difference basis functions. In fact we 
show that a central difference basis function Gc is such that Gc G C^^"~^\ D'^^~^Gc G C(°) (lR-^\0) and 
£)'2n-iQ^ is bounded a.e. 

Theorem 111 Suppose w is the tensor product central difference weight function defined in Corollary 
102 and set 

Ga = (-1)('-") D^C-") (*A)' . (3.27) 

d 

Then this weight function generates the tensor product basis function G{x) = Y\ Gi {xi) where 

Gi{s)= [ GA(^)\tf''-'q{t)dt, sGR\ (3.28) 
Gi e C^^""^^ and Gi is an even function. Further, for 1 < k <2n — 2 

D^G, (s) = J [sgntf {d'^Ga) (j) q {t) dt, s e R\ (3.29) 

Finally, for < t < k 

D''Gi(s) <^-^ , 1?° ' "\ seM^O. 



Here Ga is given by 3.27. 
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Proof. We first note tfiat from Subsection 1.4.4 tfie multivariate basis function will just be the product 
of the univariate basis functions. 

Because of the multiplicity of variables involved we will use the notation F[] for the Fourier transform. 
We want to express 1/w as a Fourier transform. Continuing on from equation 3.4 



w 



2,1 

1 = 1 1 / sin^' (t) dt=^ [ (Ci)'('-") it) dt 



21 |t 
2 



= / F^. 



Now 



w{0 



1 

/2tt 



{0\t\^^-\{t)dtdt 



(3.30) 



and we want to use Fubini's theorem to change the order of integration. Indeed 



j j\e-^^^F,[GK[^\{m?''-\{t)\d^dt< j j 



Ga(t 



(C) d^ \tf''-^q{t)dt 



1 

V2tt 



1 

V2^ 



\F[GA]m\ \t\d^ \t\^''-'q{t)dt 



sin"' {Ct/2) 2n-l . 
—md^ \t\ q{t)dt 



sin"' (77/2) |,|2n-l /.s,,, 

2:^d'n \t\ q{t)dt 



(^/2)^ 



/ F[Gf.\{n)d^ \t\^--\{t)dt 



= Ga (0) 



q{t)dt, 



so the order of integration can be reversed to give 



Gi (s) 



j e-^««F, [Ga (^)] (0 rfC lil'"-^ « (0 = I Ga (^) \t\'-' q it) 



dt. 



as required. 

By Theorem 98 w is even, flence Gi is even since w is real- valued. Also by Theorem 98, k + 1/2 < n 
so that [2kJ < 2n - 2. But by Theorem 24 Gi e C^^^"-'^ and so Gi e C^^""^\ 
To prove 3.29 we start with the equations 3.30 for Gi. If fc < 2n — 2 



Ga(t 



An argument similar to that used to change the order of integration in the case k = allows us to 
write 

(^) = vfe / / [(^'Ga) (0] (0 d^ t-^ q (t) dt 



t-^\t\^'^-^ q(t)dt 



= I {.gntfiD'G^) (^Itr-'-'qit) 



dt, 
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as required. 

Again from 3.29, since D'^Ga is bounded with bounded support, for any < r < A; 



so that 



l^rlD'^Gi (.)|< I \\s\^iD''GA) (0 



\t\''''-''-'q{t)dt 



\f''-''-'+^q{t)dt 



< \\\x\ 



tf''-''-'+'q{t)dt 



\,\2n—k— 1+T 

1*1 9 



|||xr£)'=GA|| 
ID'^G, {s)\<— 



Remark 112 The expressions 3.29 and 3.28 are multiplicative convolutions and exponential substi- 
tutions would convert them into additive convolutions. Then, with appropriate modifications, Fourier 
transforms could be used to express q in terms of the basis function and perhaps even enable us to 
characterize the basis functions. 

However, we could instead use the theory of Mellin transforms and the fact that the Mellin transform 
of a multiplicative convolution is the product of the Mellin transforms. 



function w (^) = ^1^^^^^ with property W2. Suppose also that q decreases faster than any power of \-\ at 



Corollary 113 Suppose Gi is a one- dimensional central difference basis function generated by a weight 

pin 

function w (^) = ^1^^^^^ with property W2. S 
infinity. Then for < r and < fc < 2n — 2 



iD'^Gi (s) < ^ 



G Ri\0. 



For the next theorem we will prove the following version of Leibniz's theorem for differentiating under 
the integral sign. 

Lemma 114 (Leibniz) Suppose we have a function / : — > C and a point sq € M. Suppose f has the 
following properties : 

1. In some neighborhood ofM (sq) of sq and for almost all t, f (s, t) G C^°^ and Dif (s, t) is piecewise 
continuous. 

2. For each s e J^{so), /(«,-) € and Dif{s,-) e 

3. There exist constants c, e > such that 

\Dif{s,t) - Dif{sQ,t)\ dt<c\s- Sol , \s - Sol < e. 



/I 



(3.31) 



Then we can conclude that in some neighborhood of sq, -Di / f {s,t)dt = J (-Di/) {s,t)dt and this 
is a continuous function. 

Proof. The conditions of parts 1 and 2 imply that for each fixed t, f (s, t) is absolutely continuous as a 
function of s in a neighborhood of sq. This absolute continuity enables us to write 



f is,t) - f {so,t) 
h 



- {D,f) (so, t)]dt = ^j j'^ {{D^f) (w, t) - {Dif) (so, t)) du dt, 
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where h = s — sq. Condition 3 of this lemma allows us to use Fubini's theorem to change the order of 
integration in the following calculation. Assuming that s > sq 



/( 



h 



{D,f){so,t)]dt 



1 

< - 
- h 

_ 1 

~ h 



So 



\{D,f) {u,t)-{D,f) {so,t)\dudt 
{u,t)-{Dif) {so,t)\dtdu 



< T (w - So) du 

^ Jso 

= I (s - So) • 
The same estimate can be obtained when s < sq- ■ 

We have shown that the basis function lies in C^" Now we will use the lemma to consider the 

derivatives of order 2n — 1. 

Theorem 115 Suppose Gi is a one- dimensional central difference basis function generated by a weight 



y2n 

function w = ^21 ^(^^^ satisfying property W2. Suppose also that q is bounded a.e. i.e. fi 



.e. for some constant 



Cg > 



q {t) < Cq a.e. 



Then, if Ga is given by 3.27, 

D^^-'Gi is) = J sgn (t) (d2"-1Ga) (|) q (t) dt, s e M^0, 



(3.32) 
(3.33) 



D2"-iG'i e C(") (Mi\0) and D'^^'-'^Gi is bounded a.e. 
Further, for < r < 2n — 1 



(s) < 



\x-D^-^GA\\\\t-q\\, 



s e K^o. 



Proof. We will define 



f{s,t) = {D^--^GA) (I) \t\q{t), 



and verify that / satisfies the three properties of Lemma 114. First note that from Theorem 29, 

Ga G Cq^""^'' and Z3^"^^Ga is a piecewise constant function with bounded support. Also, since Gi has 
weight function property W2, q e and /|.|>^ q (t) dt < 00 so that / \t\ q (t) dt < 00. 

Property 1 Thus for each t^O, f {s, t) e G(°) and since 

D^f {s, t) = sgn it) (D^^-'Ga) (^) q (i) , (3.34) 

it follows that Dif {■, t) is piecewise continuous. 

Property 2 Since Ga e G^^""^\ £)2n-2^^ jg bounded and hence 

j \f{s,t)\dt<\\D^"-^GA\\^J \t\q{t)dt<^, 

and 

J \D,f{s,t)\dt< J\{D^"-'Ga) (^)\q{t)dt<\\D^^-'GA\\^Jqit)dt<oo. 

Property 3 Assume sq 7^ 0. Suppose supp£)^"~^GA C B (0;r) and set imin = max{|so| , |s|} /r. Then 
J \D,f {s, t) - D,f {so, t)\ dt = J\ (D^^-'Ga) (I) - (D^^'-'Ga) (^) I q {t) dt 

= I \{D'-^GA){^)-{D'-'GA){^)\q{t)dt. 

\t\>tmln 
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Since L*^" ^Ga is a step function with bounded support it can be written as the sum of a finite number 
of rectangular functions, say {afcUfel^^, so that 

J \D,f {s, t) - Dif (so, t)\ dt<J2 «fe J |nfc (^) - Hfc (^) I q (t) dt 

<||i52„-i^A||^^ / |nfc(|)-n,(|)|g(i)di. (3.35) 

I 1 " 



\t\>t„ 



Suppose Ilfc has support 



. Observing that 



z 

{x : Hfc (a;) - (x + /i) 0} C |J { 



X : 



a; — a; 



(fe) 



< 



\h\}, 



we have 



i=i 

2 



< 



£ _ £0 
t t 



7 = 1 



£o _ (k) 
t 



Next apply the further constraint |s — So| < |so| /2 so that = implies the corresponding set is 
empty. \i xi,X2 ^ 



So 
(fc) 



.(fc) 



— -'l -r -12 ^ 



with ^ We can now conclude from equation 3.35 that when |s — so| < l^ol /2 

J\D,f{s,t)-D^f{so,t)\dt<\\D^^-'GA\\^f2 J q=\\D^''-'GA\\^ J q, 



fe=l 



J(so,s) 



where, J (sq, s) = IJ [l{'ul^')= [j [ak-bk\s- sq\ ,ak + bk\s- sq\] with bk > 0, au 0, and 

k=l ^ ' fe=l 

^ J(so,s)- 

Now the additional assumption 3.32 we made about q comes into play. In fact 



o)c+6)c|s— Sol 



On+fcrils — Sol 



/fit ni, li nt 

q{t)dt = J2 / Q<CqJ2 / <Cg^26fc|s-So 

X k=l , I I fe=l .1 I fe=l 



J(so,s) 



afc-6fc|s-so| 



^Ti — fon|s— Sol 



= '^^<l[^^k] |s-So|, 



SO that 



J \Dif {s, t)-Dif (so, t)\dt< 2cg bk^\s- sol , 
when \s - so\ < \so\ /2 and sq ^ 0. This proves property 3 and hence 3.33 and D^^-^Gi G G^') (M^yo). 
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From 3.33 



|i)'"-'Gi(s)| < / {D^^-'Ga)(j) q{t)dt 

< ||_p2n-i^^||^y q{t) dt <oo a.e. 

Again from 3.33, since D'^^~^Gx is bounded with bounded support, for any < r < 2n — 1 
|5r|D2"-iGi (5)1 < j ||5r(D2"-iGA) [^\q{t)dt = J \\'-[ (D'^-'Ga) [^\\trqit)dt 

<\\\xriD'-'GA)\Ll \trq{t)dt 



= \\\xr d^^'-^ga 



IC^IIi 



3. 3. 1 Examples 

Example 116 In this example we show that the univariate function A (a;)^ is a central difference basis 
function with weight function w (^) = -^^^ ^ ^^in^ • 

Suppose I = n = 1 and q{x) = R{x) where R is the rectangular function with support [—1,1] and 
height 1. In this case Ga = A i.e. the hat function. Then the central difference basis function is given by 



G{x)= j A(^yt\R{t)dt = 2 J ^{^)tdt, 



so that G{x) —Q when \x\ > 1, since A (x) = when > 1. 
When \x\ < 1 



G{x) 



\x\ 



1- ^]tdt = 2 



f {t - \x\) dt = [i^ - 2 \x\ t] I = (1 - 2 1^1) - (\xf - 2 \xf) 

J\x\ 



= l-2|x| + 
= {l-\x\f. 



Thus G{x) ^ A {xf 

2 sin 

1 _ A2,g(0 _ (0 _ - (9 (0 - 2g (0) + g (-0) 



Since g(^) = definition of w 



w{0 



e 



2 2-2^ 
4 ^-sin^ 



and thus w {^) 



Example 117 Noting the convergence claims in the sentence before Definition 97 we will now construct 
a sequence of central difference basis functions which converges uniformly pointwise to the hat basis 
function. See Theorem 122 for a more general result. Here we choose the sequence 

Qk (t) = ^R (kt) , k=l,2,3,..., tGR\ 

where R is the unit rectangular function with support [—1,1]. Suppose I = n = 1 so that the central 

'erence basis function is: 

Gk {x) = /a (I) \t\qk {t-l)dt^ /^(i) \t\^R{kit-l))dt='^l^'y A{^)tdt. 
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The corresponding weight functions are derived in Corollary 110. 
We proceed by considering x in three separate sets: 

Case lQ<x<l-\. 

^ , , k f^+i , /x\ , k /■^+* x\ , k /•^+^ 



tx 



{t - x) dt 



k 
2 



1 + 



k 
2 



'4 



= {l-x) = A{x). 



'4 



Case 21- I < X <1 + Then 



Gk (x) 



A 



(i) 



k 
2 

2 

2 



{t - x) dt 



tx 



1 

'+k 



k 
2 



1 + 



1 + 



X + -X 



k 
4 



Case 5 1 + i < X. Clearly Gk (x) = 0. 
From the three cases we conclude that 

( A(x), 
G,{x) = \ + 



o<N<i-i 
i-f <N<i + i, 
1 + 1 <N, 



and Gfc e C (M^) n C^^) (M1\0) n PVFC°° (K^V {-1,0, 1}) . We can also write 

Gk {x) = A (x) + Ak {x) , 



where 



Afc {x) 



0, 

!(i-N + i)'- 
!(i-N + i)'. 

0, 

0, 

!(N-(i-i))' 
!(i + i-N)% 



(i-N 



0< < l-i, 
1 - i < bl< 1, 



1 < |a;| < 1 
0< |x| <l-i, 

i-i<Nl<i, 
i< W <i + i 



fe' 



(3.36) 



(3.37) 
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We note that 



SUPP A/; 



0<|Afe {x)\<^. 



3.3.2 Lipschitz continuity 

The next two theorems will show that the continuous derivatives of the basis function are uniformly 
Lipschitz continuous of order 1 on . We start with the univariate case. 

Theorem 118 Let w (^) = ^1;^^^ be the univariate central difference weight function introduced in 

Definition 97 and, which satisfies property W2. If Gi is the basis function of order zero generated by w 
the derivatives D^Gi {x), k < 2n — 2, are uniformly Lipschitz continuous of order 1 on M} . In fact 

\D''G,{x)-D'^G,{y)\ < \\D'^+'GA\\^\\t'^-'-'q\\,\x - y\ , x,yGR\ 

where Ga is given by 3.27. 

Proof. From 3.29 of Theorem 111 



D'^G^ (x) - D'^G^ {y)\<J\ (D^Ga) (I) - (D^Ga) (f ) I q {t) dt 



< 



D'^Ga) (f ) - (D'^Ga) (f) 



\t\^''-''-'q{t)dt. 



Since Ga is a constant scalar multiple of the extended B-spline, from Theorem 118 we have for 
dimension 1 and k < 2n — 2, 



\D''GAis)-D''GAis')\<\\D''+^G\\ \s-s'\, s,s' gR\ 



and thus 



ID'G^ {x)-D'^G, (y)| < Wd'^+'GaW^ j 



X _y 
t t 



< \\D''+^Ga\\ \\t 

— II II oo " 



2n-2-fe 



9 1 \x-y\ 



To deal with the multivariate ease we will require a lemma concerning a distribution Taylor series 
expansion and which follows directly from Lemma 42 Chapter 1: 



(0) 



Lemma 119 Suppose that G e Cg 
functions. Then 



and the (distributional) derivatives {-D"G}i . are bounded 



G{x)-G{y)= J2 i^-vT I {D^G){x-t{x-y))dt, 
Ia|=i 



\G (x) - G (y)| < max ||D"G||^ \x-y\. 
I«l=i 



(3.38) 



Now we can show that the continuous derivatives of the tensor product central difference basis functions 
are also Lipschitz continuous of order 1. 



Theorem 120 Let G{x) = Yl Gi (xk) be a central difference tensor product basis function with param- 

fe=i 

eters n, I, q{-), and q is bounded when n = 1. Then Gi e G^"~^\ £)^"~^Gi is bounded and we have the 
estimates 

l'^, (3.39) 



\G{x)-G{y)\<VdG,{0f-'\DG,\^\x-y\, x,y G 
and z/0 < |/3| < 2n- 2 



\Df^G{x)- D'^G{y)\ < Vd ( maxlDkO/^Gl 

y fe=i 



\x-y\, x,y€ 



(3.40) 



3.4 Weight and basis function convergence results 
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Proof. By Theorem 111, d e c|j^" and by Theorem 115, D^''-^Gi is bounded. 

Thus G e C|j°^ (R'') and the derivatives {D"G}, , , are bounded functions. Thus G satisfies the 
conditions of Lemma 119 and so the estimate of that lemma holds i.e. for all a;, y e M"^ 



|G(x) -G{y)\ < Vd ( ma^ \D'^G\Jj \x - y\ < Vd\Gif-' \DG^ 

\\a\ = l J 



cx>F- 



Finally, from 1.30, |Gi (a;)| < Gi (0) which proves 3.39. 

To prove 3.40 observe that if < fc < 2n - 2 then D^Gx e C'^2n-2-fc) ^j^,;^ D^^^Gx is bounded. 
Thus if < |/3| < 2n - 2 then D^^G e G^°^ (M<^) and the derivatives {B'^D^G}^^^^^ are bounded 

functions. Thus D^G satisfies the conditions of Lemma 119 and so the estimate of that lemma holds i.e. 

for all a;, y G 



1- , y ^ ii-i 

\D^G{x)-D<^G{y)\ <^d{^p\^D°'D^G\^ \x-v\ <\rd[xA^\D^,D^G\ 



OO 



\x-y\ 



3.4 Weight and basis function convergence results 

In the preamble to the definition of a central difference weight function in Subsection 3.2.1 several asser- 
tions were made regarding the convergence of sequences of central difference weight and basis functions 
to extended B-spline weight and basis functions. This section is devoted to proving these assertions. 



3.4-1 Pointwise basis function convergence 

The next lemma shows that the tensor product extended B-spline basis functions are Lipschitz continuous 
of order 1. 

d 

Lemma 121 (Theorem 43 Chapter 1) Let G{x) = H Gx {xk) he a tensor product extended B-spline 

fe=i 

basis function, as described in Theorem 29. We then have the estimate 

\G{x)-G{y)\<VdGx{Qt-'\\DGx\\^\x-y\, x^y&W'. 

Theorem 122 Suppose l,n > 1 are integers which satisfy 1 < n < I. Let G L^ (M^) be a function 
which satisfies tp (x) > 0, ^ (0) = 1 and 



/ 



\tf"tp{t)dt<oo. (3.41) 



Then the sequence of functions qk {t) = \/27r2~^'fcV' {k {t — 2)), fc = 1, 2, 3, . . . generates a sequence of 
central difference basis functions, say Gk, which converges uniformly pointwise to the extended B-spline 
basis function Gg with parameters I and n. In fact, there exists a constant C = C (Gg) which depends 
only on Gg such that 

o2n— 1/^ ((^ \ 

\Gk{x)-Gs{x)\< x€R\ 

Proof. It is clear that functions satisfy the conditions of Theorem 98 so that each fimction ^21-^^1^^-^ 
is a central difference weight function with parameters I and n. By Theorem 111 the corresponding basis 
functions are 



Gk (x) = j Gs (I) fcV {k {t - 2)) dt, 
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^ (-!)('-") o-2in2(i-«) 



where = 2-^'£)'^('-") (*A)'. Since / V = 1 it follows that 



Gfe (x) - (:r) = y (g, (|) - (x)) fc^ (fc (i - 2)) dt 

^ k 

= y (g, (I) - G, {x) fcV (fc - 2)) dt+ 

+ ' (Gs (a;) - Gs (a;)) yfc^ (fc (i - 2)) dt 

= (g, (I) - G« (x)) If "-^ fcV {t - 2)) dt+ 



f Gs{x) - l) A:V' (fc {t - 2)) dt, 



so that by the Lipschitz continuity estimate on derived for the extended B-spline in Lemma 121 



\Gk {x)-G, {x)\ < 



Gs{'-]-Gs (x) ^^"-ifc^ (k (t - 2)) dt- 



< c 









X 


L 


X 




t 



+ \\Gs 



/•i+i 
/ 



\Gs {x)\ t^"-^ - 1 ki) {k {t - 2)) dt 



t^^-^ki; {k {t-2))dt+ 



/ |i2"-i-l|A;V(fc(i-2))dt 

|l_t|f2n-2^^ (fc {t-2))dt+ 



|G, 



|t - li (1 + t + . . . + t2"-2) fc^ (fc (i _ 2)) dt 



^ c2^-'\x\ ^ ||G,||^2^»-i ^ 



and since G^ has compact support we have uniform convergence. 



3.4-2 Weight function convergence 

Recall that at the end of the Subsection 3.2.1 it was suggested that if € satisfies {x) > and 
J = 1 and an extra condition then the sequence of functions qk (t) = ^^-kip {k {t — 2)) e is such 

that ^ ^In^^'' converges to ^^^^ in the sense of tempered distributions. Here ^^^^ = ^ ^^^^ € where ti;^ 
is the extended B-spline weight function with parameters n, I. The extra condition referred to is 3.41 i.e. 

i|t|>fl 1*1^" * dt < 00 for some i? > 0, which is equivalent to / i/) (t) dt < oo since J tp < oo. This 
assumption is stronger than 3.3 which was the used in Theorem 98 to ensure that a central difference 
weight function has property W2. We will prove convergence and this implies convergence in 5". 



3.4 Weight and basis function convergence results 
Assume ^ > 0. Then using the change of variables: s = dt = jds 



Wk (0 



^ 22(i-r.) f 

^ J 



t^"qk it) 



dt 



I (t/lf kv ik il 2)) 



dt 



Ws {it/2) 



ds 



1 [2k ^ /2k 



(si 



,2i 



sm s — sm 



' i) ds+ 



+ ^jj^{ji^-0)sin-ids 



1 f2k /2k , ,2, . 2^.^ , sin^^ / 



sin s — sin 



■C) ds + 



sin-"' C 



so that 



Now split the domain of integration about |^| = R: since J tp = 1, 



\-\>R 



1 1 

Wk Ws 



< 

l«l>fl 

< 2 



/ ^ / f ^ (f ^) (* + - sin^' ^1 dtdi 



< 2 



= 4 



1 /■ 2fc , /2fc , , 

\i\>R 

|«|>7?, 

4 1 



2n - 1 ■ 
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If Gg is the B-spline basis function wc can write 



-^p(^j?j (sin^' (e + i)-sin''0 dt 



]_ r2k.f2k, 
2k /2k \ f sin^' i^ + t) sin^'S.^ 



^2n 

N2n 



e 



dt 



f 2k , f2k 

= J It' 

= E 



t 



2n 



t]{{l + -] G,{^ + t)-G,{0]dt 



2<m<2n 



m<2n 

2n\ f 2k f t 



CD (I 



+ (0 dt 



^[jt]Gs{,^ + t) dt+ 



2k (2k 



(g, + - (O) 



E 

2<m<2n 



f2n\ r 2k ft 
\m) J T U 



i^[jt]Gs {^ + t)dt+ 



+ jj^ {jt) {G4^ + t)-GA^))dt. 



(3.42) 



1 1 



< 



f2n 



2<m<2n 



m 



2k ft 



i'[jt]Gs{^ + t) dtd^+ 



2k /2k 



G, (^ + t)-G, 



dtd^ + 



1 



2n- lE2»-i' 



We will consider the last integral in 3.42 first. Since 

Gs{^ + t)-GsiO = e^^.iO-GliO 



((e^M-l) G^YiO 
eiit,-) _ 1 



<|<| ||a;G,(x)||,, 



A 

G.) (C) 



it follows that 



2k 



Gl{^ + t)-Gs {0\dt<\\xGs {x)\ujj^^\t\^(^t]dt 



2_k 
I 



^\\xG, {x)\\,\\si^{s)\\,. 



which exists since ip ^ and {s) ds < oo. Thus 



\i\<R 



2k , /2A: 



Gs{^ + t)-GAO\dtd^< I ^-^\\xGs{x)\U\\si,{s)\\,dC 



l«l<« 

= ^||a;G, ix)\\,\\si^is)\\,. 



(3.43) 
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Regarding the integrals under the summation sign of 3.42, for m = 2, 3, . . . , 2n we make the change of 
variable s = to get 



2k ft. 

T U 



2k 



< 



2k , /2fc , , 



R 



and similarly 



2fc n 



< 



\s"^^is)\\,. 



R 



(2fc)' 
R 

{2ky 



so that 



< 



2R 
2R 



s\ ip{s)ds 

■j \sr^p{s)ds 



Inequalities 3.43 and 3.44 can now be used to estimate the right side of 3.42: if i? > 2 



1 1 



< 



2<m<2n 



2n\ 2R 



R^ 



{2kr "'"^^'^iii + 2k i'"'^^ ^''^"i 



(3.44) 



+ 
2k 



2n-l 

E 



I 2<m<2n 



= g (22« - 1 + llxG. (x)llO r|Sx{||/'V WIIJ + 

and it is clear that given e > the right side can bc^ made less than e by first choosing R such that 
fjit-i < § and then choosing k so that the remaining terms also do not exceed e/2. Thus we have shown 
that — > — in and hence in S'. 



3.5 Local data functions 

In this section we will use the generalized local data function results of Theorem 91 and Corollary 92 
of Chapter 2 to characterize locally the data functions for the tensor product central difference basis 
functions as Sobolev-likc spaces. These results supply important information about the data functions 
and makes it easy to choose data functions for numerical experiments concerning the zero order basis 
function interpolation and smoothing problems discussed later in Chapters 2, 4 and 5. 



3. 5. 1 Tensor product weight functions 

Here we are interested in tensor product weight functions e.g. the multivariate; central difference weight 
functions and the extended B-splines 1.20. We want a result which allows us to prove that there exists 
integral n such that the local Sobolev-like space 



ff," {n) = {w e (fi) : D'^u e (n) for a < n} , 
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of Definition 81 is a subset of the restriction space X° (recall Definition 86). Further we want this 
result to (essentially) involve only proving estimates about the univariate weight function. This 
result is Theorem 123 and will allow us to justify using il" (O) functions as data functions in numerical 
experiments. 

Further, if wi satisfies the assumptions of Corollary 124 then (f2) = (f2) as sets and their norms 
are equivalent. 

Theorem 123 Let w be a tensor product weight function on R'^ generated by the univariate weight 

function wi. Suppose: 

1. n C M'' is an open, bounded set which permits one of the extension operators £q : (fi) if" 
described in parts 5 to 7 of Remark 82. 

2. There exists a bounded function gi -.M} ^ and constants cj^^ > such that 

and if we define g -.W^ —^ by g (^) = gi (^i) x . . . x gi (^d) then 

supp{g{D)u-u)c([-l,lfy , ueHS^i-l,!)'), (3.46) 

where g {D) u = {gu)^ . 

3. There exists an open cube C (with center c) containing fl and an integer m > for which 

wi (t/m) < c^wi (t), te K\ (3.47) 

for some constant c^ > and (C — c) /m C [—1, 1]"^. 

Then (il) ^ X'^^ (fi) and this inclusion is also valid for any scaled weight function with appropri- 
ately scaled ft. 

Proof. The proof involvc's showing the assumptions required by Theorem 91 are met. Firstly, observe 
that assumption 1 of this theorem is assumption 1 of Theorem 91. Next define the bounded function 

g:Rd^C^hy g{0= U 9 (Cfc)- Then if C7„ = ci'^ci'J . . . dj] , 
fe=i 

k—l k—1 \ak<n j a<n a<n 

which confirms 2.75 and as a consequence 3.46 implies that assumption 2 is also satisfied. 
Finally, if Cm = (cm^^ and inequality 3.47 is valid then 

d d 

w {£,/m) = Wwi i^k/m) < Y[ c^'wi (^fe) = CmW (0 . 

k=l k=l 

which means that assumption 3 of Theorem 91 is also satisfied. The conclusion of this theorem now 
follows from that of Theorem 91. ■ 

This corollary supplies a condition under which X° (f2) = iJ" (f2) as sets with equivalent norms. 

Corollary 124 Suppose the assumptions of Theorem 123 hold. Also suppose f2 C Rn where Rq is an 
open rectangle and assume that there exist constants Ck > such that the univariate weight function wi 
satisfies 

d 

wi {t) > Ckt"^, t&yji^j {Ra) , fc = 0, 1, 2, . . . , n, (3.48) 
j=i 

where nj (x) = Xj for x = (xj) E M.'^ . 

Then (il) = (O) as sets and their norms are equivalent. This result is also valid for any scaled 
weight function with appropriately scaled fl. 
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Proof. Suppose ^ e O and a < n where a is a multi-index. Set Ca = H • Then ^ e Rq and 



m—l i=l 



and this corollary now follows from Corollary 92. ■ 
Examples 

We present two applications of the theory of data functions introduced in the last subsection to tensor 
product central difference weight functions. 

In the first example the strong integral condition 3.11 is imposed on the weight function because this 
condition is associated with simple upper and lower bounds on w which imply (O) = if" (O) as sets. 

Theorem 125 Suppose w is a tensor product central difference weight function on R'^ with parameters 

n,l,q which satisfies the condition 3.11 i.e. J^i^^^^ f^^q{t) dt < oo. Further suppose that fl is an open, 
bounded set which permits one of the extension operators £q : H" [fl) described in parts 5 to 7 of 

Remark 82. 

Then (O) = if" (O) as sets and the norms are equivalent. 

Proof. Denote by Wi the univariate weight function. Prom Corollary 109, given any a > there exist 
constants Ca,c'^^,ka,k'^^ > such that 

kat'^"- < wi (t) < k'j'^"-, \t\ > a, (3.49) 

and 

^<w,{t)<^, \t\<a. (3.50) 

Wc will verify that 3.49 and 3.50 imply that the conditions of Theorem 123 and Corollary 124 hold. 
Noting Remark 94 above, if wc choose g (^) = (sin^)' " then 3.46 is satisfied. In fact, if wc (arbitrarily) 

choose a = 1 then: |t| < 1 imphes \gi {t)\ wi (t) < "a^-r.) — c[ < c'^and \t\ > 1 impHes \gi {t)\ wi (t) < 

(sint)^^'"") fc'ii^" < k[t^''\ so that 3.45 holds. 

Now we show that 3.47 holds for all integer m > 1: if \t\ > 1 then from 3.49 Wi it/m) < -^t^^ 

and 1/wi {t) < so that ^^^^^ < if 1*1 < 1 then from 3.50, Wi {t/m) < "'^^'r-n/^ and 

l/w^i {t) < and thus '"ij^^'' < ""'^'f'''' - ^^^^^ ^•^'^ ^^^'^^ integer m > 1. 

The next step is to show that 3.48 holds: here we choose 

{d 1 d 

1*1 : f e U TTj (Rq) > so that if t € (J TTj (Rn) and a < n, from 3.50 
j=i J j=i 

/,\ ^ C<j Ca 1 Ca Ca t\ Co 2a 

^1 W - ^2((-n) - a^il-n) (•^/Q^2(/-n) - a2(J-n) - a2(Z-n) J " g^2(a+l-n) ^ ' 

as required. ■ 

The last result assumed that /|t|>^ t^'^q (t) dt < oo. We now weaken this assumption to 

I\t\>R < ^iid see what extra assumption is required to ensure that X^ (O) = Hi (O). 

Theorem 126 Suppose w is a central difference weight function on with parameters n, I, q which 
satisfies the condition 3.7 i.e. J^^^^j^t'^"'q{t) dt < oo. Further suppose that O is an open, bounded set 
which permits one of the extension operators £a : if" (fl) if" described in parts 5 to 7 of Remark 82. 
Suppose, in addition, there exist constants r,ki > such that 

^(0>^, \^\<r. (3.51) 

Then (il) — H'^ (il) as sets and the norms are equivalent. 
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Proof. We first verify that 3.49 and 3.50 imply that the conditions of Theorem 91 and CoroUary 92 hold 
for the case of one dimension. Prom 3.9 there exists a constant Ci > such that 

10(0 eeR\ (3.52) 

and from 3.17 there exist constants ci, C2 > and < a < 6 such that 

»(0<{i52: (3^53) 

Using these two estimates and the additional assumption 3.51 we can derive inequalities of the form 
3.49 and 3.50 and so prove this theorem in a similar manner to Theorem 125. ■ 



3.6 Interpolant convergence to its data function 

We will use the results of 2 to derive orders for the pointwisc converge of the minimal norm interpolant 
^xf to its data function /. These estimates will have the form 

1/ [x) - {Ixf) {x)\ < kc ||/L,o ihx,ar , x&Ti, 

where hx,Q. = sup dist (w, X) < hn,K measures the density (sparsity?) of the independent data X 

wen 

which is contained in the data region Q. Here m is the order of convergence. 

The results of Chapter 2 can be classified according to whether they require the independent data X 
to be unisolvent and a full discussion of unisolvency is incorporated into Subsection 2.5.3. 

It turns out that all the convergence orders obtained here for the central difference weight functions 
are identical to those obtained for the extended B-splines. 



3.6.1 Convergence estimates derived without assuming unisolvent data sets 

Following the approach of Chapter 2 we consider two types of convergence results for the case when the 
data is not assumed to be unisolvent. These are imaginatively called Type 1 and Type 2. 

Type 1 convergence estimates 

Recall that Type 1 convergence estimates are those obtained using Theorem 59, where there are no a 
priori assumptions made about the weight function parameter k but the smoothness condition 3.54 is 
applied to the basis function near the origin and this will allow a uniform order of convergence estimate 
to be obtained for the interpolant in a closed, bounded, infinite data region. 

Theorem 127 ( Copy of Theorem 59) Suppose the weight function w has property W2 and that G is the 
basis function generated by w. Assume that for some s > and constants Co, ho > the basis function 
satisfies 

G{0)-ReG{x)<CG\x\'^% \x\ < ho- (3.54) 
Let Xx be the minimal norm interpolant mapping with the independent data set X contained in the 

_d 

closed, bounded, infinite set K , and let ka = (27r) * \/2Gg- Then for any data function f G it 
follows that yjif -Ixf, /)^,o ^ ll/IL,o 

1/ (x) - Ixf {x)\ < kcsjif - Ixf, /)^,o ihx,Kr , xGK, (3.55) 
when hx,K = max dist {x,X) < ha i.e. the order of convergence is at least s. 

In the next result we derive constants s, Cq and ha for which the central difference basis function 
satisfies equation 3.54. 
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Theorem 128 Suppose the weight function w is a tensor product central difference weight function with 
parameters n, I, q{-), and q is bounded if n = 1. Let G be the basis function generated by w. 
Then the basis function satisfies the estimate 3.54 for s = 1/2, ho = oo, 

Cg = Gi {0f-^\\DGi\\^Vd, 

and 

ka = {2n)-i ^/2C^ = (277)"^ ^2Gi (O)'^"' ||DGi||^ ^. 
Proof. Let Gi be the univariate basis function. By Theorem 120, Gi G Gg\ DGi is bounded and 
\G{x)-G{y)\< y/dG^ {of'' \\DG^ ||^ \x-y\, x,y e 
so that we can choose Cg = Gi (0)'^"^ \\DGi\\^ Vd and 

ka = {27r)-i ^2^ = (277)" ' ^2Gi (0)'^-' ||DGi||^ ^. 



Type 2 convergence estimates 

The Type 1 estimates of Theorem 127 considered the case where the weight function had property W2 
for some «; > and an extra condition was apphed to the basis function. Type 2 convergence estimates 
only assume k > 1. In the next theorem we only assume that k > 1 and derive an order of convergence 
estimate of 1, as well as a doubled convergence estimate of 2 for the Riesz data functions Ry. Note that 
the constant ko in the next Theorem is defined to match the constant kc for the Type 1 convergence 
estimate 3.55. 

Theorem 129 (Copy of Theorem,65) Suppose the weight function w has property W2 for a parameter 
K> I. Suppose also that Ixf is the minimal interpolant on X of the data function f G and that K 
is a hounded, closed, infinite subset ofW^, X c K and hx,K = maxdist {x,X). 

Then 

1/ (x) - (Ixf) {x)\ < kG^{f-Ixf,f)^^ohx,K, x€K, 

where 

k% = {2^)-" / ^ = - (27r)-^ (V^G) (0) , ^(/-Jx/,/L,o < ll/IL,o . 
and the order of convergence is at least 1. Further, for the 'Riesz' data functions Ry, y G K, we have 

\Ry {x) - {IxRy) {x) I < ka {hx,Kf , x,yGK, 
i.e. a 'doubled' estimate of at least 2 for the rate of convergence. 

Since the weight function has property W2 for «; iff k + 1/2 < n we can choose k = 1 iff n > 2. Then 
by Theorem 129, ka = {27:)"^'^ (V^G) (0) and since 

d d 
(V^G) (0) = ^ {DIG) (0) = Y.Dl (Gi (xi) G, {x^) ...G, {xa)) (0) 
fe=i fe=i 

d 

= ^Gi {of-^D'^Gi (0) 
fc=i 

= Gi (0)'*-^ D'^Gi (0) d, 

we get the formula 

ka = (27r)-'^/^ ^-Gi (0)'*-^ D'^Gx (0)\/d. (3.56) 
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3.6.2 Convergence estimates assuming unisolvent data sets when k, > 1 

In fact comparison of Theorem 7, Chapter 1 for the extended B-spUnes with Theorem 98 of this document 
shows that for parameters n and I the smoothness parameters k can take an identical sets of values when 

/ri 1^1^"^^ 1 iO dS, < oo, since q e . 

Wc now require the following convergence estimate from Chapter 2: 

Theorem 130 ( Copy of Theorem 74) Let w he a weight function with property W2 for integer parameter 
K>1 and let G he the corresponding basis function. Set m = [k\. 

Suppose Txf is the minimal norm interpolant of the data function f G on the independent data 
set X contained in the data region Q,. We use the notation and assumptions of Lemma 12 of Chapter 2 
which means assuming that X is m-unisolvent and fl is a hounded, open, connected set whose houndary 
satisfies the cone condition. 

Now set ka = ,„ \Ji2 — r (ca k)™ -K'a « max \D'^I^G (0) I . Then there exists /in « > such that 

\f{x)-{Ixf){x)\<kG\\f\\^^^{hx.nr. xen, (3.57) 
when hx,n — sup dist (oj, X) < Hq^k i-e- the order of convergence is at least m. 

The constants cn,m> LCq ^ and ha^m only depend on fl, m and d. In terms of the integrals which define 
weight property W2 we have 

max|Z?^/'G(0)|<(2.)-^/M!j^. 

We now show that the order of convergence of the minimal norm interpolant to an arbitrary data 
function is at least n — 1: 

Corollary 131 Suppose the weight function is a central difference weight function with parameters 2 < 
n < I. Then the order of convergence of the minimal norm interpolant to an arbitrary data function is 

at least n — I. 

Proof. Theorem 98 implies that m = [kJ = n—1 and Theorem 130 implies that the order of convergence 
is at least n—1. ■ 



3.6.3 Summary table 



Interpolant convergence to data function - central difference weight function. 


Estimate 
name 


Unisolv. 
data? 


Parameter 
constraints 


Converg. 
order 


{2^f'kG 


ha 


Type 1 


no 


q b'nded if n = 1 
else n>2 


1/2 


^2Gi {Qf-'WDG.W^rd 


oo 


Type 2 


no 


n > 2 


1 


^-Gi {of-' D^Gi {0)Vd 


oo 


Unisolvent 


yes 


n > 2 


n-1 




oo 



TABLE 3.1. 
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The Exact smoother 



4.1 Introduction 

We call this well-known basis function smoother the Exact smoother because the smoother studied in 
the next chapter approximates it. We will assume the basis function is real-valued. The Exact smoother 
minimizes the functional 

i=l 

over the data (function) space X° where p > is termed the smoothing coefficient. It is shown that 
this problem, like the interpolation problem, has a unique basis function solution in the space Wg,x- 
The finite dimensionality of the solution allows us to derive matrix equations for the coefficients a, of 
the data-translated basis functions. 

Three estimates are derived for the order of the pointwise error of the Exact smoother w.r.t. its data 
function: two estimates do not assume unisolvcnt X data and one docs. When p = these estimates 
correspond to interpolation results of Subsection 2.5.2 and the Exact smoother convergence orders and 
the constants are the same as those for the interpolation case which are given in the interpolation tables 
1, 2, 3 and 4. 



Non-unisolvent data: Type 1 error estimates When the weight function has property W2 for «; > 
it is assumed that the basis function satisfies an inequality of the form 1 and that the data region is a 
closed, bounded, infinite set K. In this case it is shown in Corollary 153 that the Exact smoother Se of 
the data function / satisfies the error estimate 

xG K, 



1/ {x) - Se {x)\ < + ka {hx,Kr) , 



when hx K = maxdist {x, X) < ho and kc = (27r) '^^^ \/1Cg- 



Non-unisolvent data: Type 2 error estimates If it only assumed that k>\ then it is shown in The- 



orem 156 that 

\f{x)-S,{x)\ < ||/IL,o(v^+fcG/lX.A-) , Xei^, 

when hx,K < oo and ko = (27r)"''/'' (W^G) {Q)Vd. 



Unisolvent data error estimates On the other hand, suppose the weight function has property W2 



for some parameter k > 1, the independent data X contains a unisolvent set of order [kJ and X is 
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contained in an open, bounded data region il. Then using results from the Lagrange theory of polynomial 
interpolation we show in Theorem 160 that there exist constants K'q ^,kG > such that 

\f{x)- {x)\ < ll/IL^o (K'^^^^ + kG ihx,n)^''^) , X G n, 
when hx,n = sup dist (a;, X) < ha- 

These theoretical results will be illustrated using the weight function examples from the interpolation 
chapter, namely the radial shifted thin-plate splines, Gaussian and Sobolev splines and the tensor product 
extended B-splines. We will also use the central difference weight functions from Chapter 3. 

Numerical results are only presented for the non-unisolvent data cases. Numeric experi- 
ments are carried out using the same 1-dimensional B-splines and data functions that were used for the 
interpolants. We restrict ourselves to one dimension so that the data density parameters hx,Q. and hx,K 
can be easily calculated. Also note that in one dimension part 2 of Theorem 214 implies that any set of 
m distinct points is m-unisolvent. 

No numerical experiments will be carried out for the unisolvent data case because I did not want to nu- 
merically estimate the parameters related to the Lagrange polynomial interpolation. However satisfactory 
easy estimates may be possible. 



4.2 The Exact Smoothing problem 

The Exact smoothing problem will be defined as a variational minimization problem within the space of 
continuous functions • The functional is constructed using a real smoothing parameter /? > and a set 

of distinct scattered data points {{x^'^\yk)]^_^, -i''*'^ G M'' and yk € C. The functional to be minimized 
is 

1^1 2 

^e[/]=p||/C + ^E|/(^''^)-2/'^ ' /e^- (4-1) 
fe=l 

The independent data will be specified hy X = (x^^^)^_^ and the dependent data by y = {yk)^=i- 
The Exact smoothing problem is now stated as: 



The Exact smoothing problem 

Minimize the smoothing functional Je[f] for f £ X[ 



Here the first component of the functional p ||/||^ q global smoothing component and the second 

component is the localizing least squares component. As p — > 0"*" it will be shown that the matrix equation 
representing this problem becomes that of the minimal norm interpolation problem 2.4. As the value of 
p increases numerical experiments show the degree of smoothing increases and the correlation between 
data and the smoothing function decreases. 

Using Hilbert space techniques I will now do the following: 

1. Show there exists a unique Exact smoother. 

2. Show the smoother is a basis function smoother i.e. show that it lies in Wg,x- 

3. Construct a matrix equation for the coefficients of the data-translated basis functions G [■ — x^''^). 

The proofs will be carried out within a Hilbert space framework by formulating the smoothing func- 
tional Je in terms of a norm. on the Hilbert product space V = X° ig) C''^. To this end I first 
introduce the following definitions. 

Definition 132 The space V and the operator Cx 

1. Let V = X^ ® be the Hilbert product space with norm and inner product (•, •)y given by 

{{U1,U2) ,{vi,V2))y = p {ui,Vi)^Q + ^ {U2,V2)cN ■ 



(4.2) 



2. Let the operator C 



X 



V 



4.2 The Exact Smoothing problem 109 

be defined by Cxf = {j^^xf^ where Sx is the vector-valued 



evaluation operator Ex f = {f i^^''^)) of Definition 51. 

Remark 133 The definitions above were constructed so I have the following result: If y = {jji) is the 
(complex) dependent data given in the smoothing problem and if I let <, = (0,2/) then for f e the 
Exact smoothing functional can he rewritten in terms of the norm \\-\\y as 



Mf] = p\\f \\l,o + j^ 



£xf - y 



{f,£xf 



[f,£xf)-{Q,y)f^ = \\Cxf-<; 



The Exact sm,oother is now the (unique) orthogonal projection of the constant (0,y) onto the infinite 

dimensional subspace Cx {^w)- 

I will need the following properties of the operators Cx, C\ and C,\C,x- 
Theorem 134 The operator Cx has the following properties: 

1. Il^jc/lly and 11/11^ Q are equivalent norms on X'^. 

2. Cx '■ V is continuous, 1-1 and has closed range. 

3. Cx ■ V^X^ and ifu = {U1U2) G V then 

C*xU = pui + J^SxU2- 
I C*xCx : ^° ^ ^° and C\Cx ■ Wg,x ^ Wq^x and 

C*xCx = pl+ ^£*x£x- 



5. C*xCx is continuous 



1-1 on X, 







Proof. Part 1. By part 3 of Theorem 52, 



so 



2 2 1 

^ = p||/IL,o + ]v 



£xf 



< 



\l,o + ^\\Rx,xf\\f\\lo 

-j^mx,xf) 11/11^,0- 



Further, since p > 



Wlo < i 



mm < /3, 



p\\f\\l,o ^ 



N 



£xf 



, = mm < p,— 



l^xfW 



Part 2. Cx ■ X'^ ^ ^ is continuous since £x is continuous by part 1 of Theorem 52. From part 1 
there exists a constant Ci such that ||/||^ — ^1 ll-^^/lly ^^'^ hence the range of Cx is closed and Cx 
is 1-1. 

Part 3. Since Cx is a continuous operator the adjoint Cx '■ V — > X^ exists and is continuous. Further, 

if f e X° and u = {U1U2) e V 

{Cxf,u)y = ,(U1U2))^ = P {f,Ul)yj,o + {^xf,U2j 



p{f,U,)^,,+ J^[f,SxU2^ 



wfi 



= ( /, PUl + J^£*xU2 
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Hence C*xU = pui + -^£^M2, where u = {U1U2) € V . 
Part 4. If / e XO then 

z:^/:;,/ = p{Cxf\ + (/:x/)2 = p (f,£xf)^ + j^£*x (f^^xf)^ = pf + j^£x£xf. 

By part 2 of Theorem 52 f*. : ^ Wg,x and it follows that jC*xjCx : X° ^ and jC^^jCx : 
VTcx Wg,x- 

Part 5. Clearly Cx^x is continuous. Now suppose ll\C,xf = for / G X° . Then = {C*xCxf, f )^ g = 

{Cxf,Cxf )yj = ll'^x/lly so that Cxf = and hence / = since £jc is 1-1 by part 2. Therefore C*xCx 
is 1-1. ■ 

I next show that the Exact smoother, like the minimum norm interpolant of Chapter 2, has the nice 
property of being in the N-dimensional basis function space 



Wg,x = aiG (• - : e c| , 



introduced in Definition 46. It was shown in Theorem 45 that the data-translated functions G (• — a;^'^) 
are linearly independent so dim Wg,x = ^• 

Theorem 135 Fix y G and let c = (0, y) G V. Then there exists a unique solution to the Exact 
smoothing problem 4-2 in X^. Denote this solution by Sg- This solution has the following properties: 

1. \\CxSe - ^\\y < \\Cxf - ^\\y, for all / e X « - {s,}. 

2. {CxSe-<;,jCx{se- f))v = 0> for all f € X^. 

3. \\CxSe - <;fv + W^x {se ' f)fv = W^xf - for all f&Xl. 
The last equality is equivalent to the equality of part 2. 

4. Se = jf{C*x^xr'£*xy- 

5. Se € Wg,x and 

1 ^ 

'- = -^lliy''- 'e{x^'^)) ^.(^) ■ (4.3) 

Proof. Parts 1, 2, 3. Because CxiX*;^) is closed it is well known there exists a unique element Sg of 
the hyperplane £x{X^) which minimizes the distance between the hyperplane £x{X^) and the point 
(J. Prom simple geometric considerations I follows that Sg satisfies conditions 1, 2 and 3 of this theorem. 

Part 4. Prom part 2 

= {CxSe - cXxSe -Cxfjy = i^XSe - ',,Cx {Se - = {C*x {CxSe - ^) , Se " /)„,o ' 

for all f €X°. Thus 

{CxSe -0 = 0. (4.4) 
Hence ^ 

L*xC,xSe = C*x<i = C*x (0, y) = j^^xV, 
and since by part 5 of Theorem 134 jC^jCx is one-to-one, it follows that jC-x^Cx has an inverse and so 

Se = {C*xCxT^£*xy- 

Pcirt 5. Starting with equation 4.4 

Q = Cx {CxSe - ?) = C*xCxSe - J^^xV = PSe + ^£*x£xSe - J^^xV 



= PSe + -j:^£*X (SxSe - y) 
1 ^ 

= PSe - X] {^e{x'^''^) - yk) Rx'.l') > 
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which proves 4.3. Finally, from the introduction to this paper R^(k) = 
(27r)"'*/^ G (• - x^*^)) and so I have Se e Wg,x- ■ 

In a manner analogous to the case of the minimal norm intcrpolant, part 4 of the last theorem allows 
the definition of a mapping between a data function and it's corresponding Exact smoother. 

Definition 136 Data functions and the Exact smoother operator : Wq.x- 

Given an independent data set X, I shall assume that each member of X^ can act as a legitimate data 

function g and generate the data vector Ex g - see item 4 of Section 4-8. 

The equation of part 4 of Theorem 135 enables us to define the continuous linear mapping Sx '■ X^ — » 

Wg,x from the data functions to the corresponding Exact smoother given by 

SScg = {jC*xjCx)-' rx£xg, gexl (4.5) 

It was shown in Theorem 134 that C^Cx '■ X^ X^ is continuous and 1-1. 1 now prove CX^Cx is onto 
and hence a homeomorphism. I also prove some important properties of the Exact smoother mapping 

ce 

'->x- 

Corollary 137 jC^jCx ■ X^ — > X^ is a homeomorphism and jC^jCx ■ Wg,x — > Wg,x is a homeomor- 
phism. 

Regarding the properties of Sx •' 

1. S^x9 = 9-p{^*x^xr^g, g&xl, 

2. Sx is continuous but not a projection, 

3. S\:Xl^ Wg,x is onto, 

4. null^i = W^^x, 

5. Sx is self-adjoint. 

Proof. Suppose g G X'^ and let Sg = 5^.9- From part 5 of Theorem 134 I know that C*xCx is 1-1. From 
part 4 of Theorem 135 C*xCxSe — j^Sx£xg and from part 4 of Theorem 134, C*xCxg = pg + j^^x^xg 
so that C*xCxg = pg + C*x^xSe. Clearly this equation implies part 1 and the equation 

g = CxCx{^-^\, g&Xl (4.6) 



This latter equation proves g G range Thus H^C-x ■ X^ — > X° is onto and by the Open 

Mapping theorem L*xLx is a homeomorphism. If g G Wg.x then 4.6 implies C*xCx ■ Wg.x Wg.x is 
onto and so C*xCx ■ Wg,x Wg,x is also a homeomorphism. This fact and part 1 clearly imply that 
Sx is not a projection. 

Further, in Theorem 52 it was shown that Sx^-x '■ X^ — » Wg,x is onto and nullSx^x = Wqx- The 
stated properties of Sx now follow from 4.5 and the fact that Sx^x and C*xCx are self-adjoint. ■ 



4.3 Matrix equations for the Exact smoother 

Part 5 of Theorem 135 means that the Exact smoother lies in the finite dimensional space Wg.x and 
this enables matrix equations to be derived for the coefficients of the data-translated basis functions 
G (x — x^*^') . These matrix equations can be written in terms of the basis function matrix Gx,x 

)) or the reproducing kernel matrix Rx,x = {Rx^i'i i^^^^)) where Rx,x = (27r) ^ Gx,x. 
The relevant properties of Gx,x and Rx,x were noted in the Introduction to this paper. Ry (x) ^ 
G{x — y) because the Fourier transform I have selected is /(^) = (27r)~''^^ J e~"^/(x)dx. For numeric 
algorithms the use of Gx,x might be preferred because G is specified directly and for analysis the use 
of Rx,x may be preferred because of the analytic properties of Rx- 
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Theorem 138 Fix y G and let <; = (0, y) G V. Then by Theorem 135 there exists a unique solution 
to the Exact smoothing problem. I denote this solution by s and note that here, and in the sequel, I will 
sometimes use the more compact notation sx = £xs. The solution s has the following properties: 

1. sx satisfies the matrix equation 

(Npl + Rx,x) sx = Rx,xy, (4.7) 

_ d 

where Rx,x = (27r) ^ Gx,x- 

2. The matrix N pi + Rx,x is real, symmetric, positive definite over and regular. 

N 

3. s{x) = X) ttkRxW {x) where a = (afc) satisfies 

k=l 

{Npl + Rx,x) a = y, (4.8) 

and Oi= ^{y — sx) when p> 0. 

The next part is convenient for numerical purposes. 

N 

4. s{x) = ^ PkG [x — x^''^) where (3 = (/3fc) satisfies 

fe=i 



({2tt)-- Npl + Gx,x) 13 = y. 



Proof. Part 1 Part 4 of Theorem 135 and part 4 of Theorem 134 imply 

C^Cxs = pis + ^£x£xs = ^r^y. (4.9) 

From Theorem 52 I have £x£-*x = Rx,x so that applying the operator (27r)'^^^ N£x to 4.9 gives 4.7. 

Part 2 From the Introduction Rx.x = {2ti)^^ Gx,x is Hermitian and positive definite over C^. Since 
this paper assumes the basis function G is real valued, Rx,x must be symmetric and thus N pi + Rx,x 
is symmetric. Now suppose a G . Then 

{Npl + Rx.x) a = Np \af + Rx,xoi, 

and since Rx,x is positive definite over C^, {Npl + Rx,x)o. = implies a = 0. Thus Npl + Rx,x 
is positive definite over and consequently must be regular. 

Part 3 From part 5 of Theorem 135, s{x) = ^ a^i? [x — x^''^) where Qfe = ^''"^^ — - i.e. a = (afe) = 

fe=i 

■ Thus Sx = Rx,xO! and substituting for sx in 4.7 we get 
{Npl + Rx.x) Rx.xa = Rx.xy ^-c matrix equation 4.8 since Rx,x is regular. 

Part 4 A straightforward consequence of part 3. ■ 
The next corollary will prove very useful. 

Corollary 139 If g G is a (complex) data function then the Exact smoother operator Sx satisfies 

S'x9 = £*x{NpI + Rx,x)~'£xg, (4.10) 

and 

{S^x9) {x) = (£xR.y {Npl + Rx,x)~^ Sxg. (4.11) 
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Proof. The data vector is £x9 and from part 3 of the last theorem (S^g) {x) = Yl Q!/si?a;(fc) (a;) where 

_ fe=i _ _ 

a = (ak) is given hy a = (N pi + Rx,x)~^ £xg- Now making use of the results concerning £x and 

in parts 1 and 2 of Theorem 52 we have 

N 

s = S^g = akRx(k) = £*xQ. = £*x {Npl + Rx,x)~^ £xg, 

k=l 

and so, since the basis function is assumed to be real valued, 

SScg {x) = fea, R^ = (a, £xRx) „ = {{Npl + Rx,x)~^ £xg, £xRx) „ 

= (£xgY {Npl + Rx,x)~^ £xR. 
= {ixRxf {Npl + Rx,xr^ £xg. 



4.4 More properties of the Exact smoother 

Theorem 140 By Theorem 135 there exists a unique solution s to the Exact smoothing problem. This 

solution has the folloiDing properties: 

1. For all f €X° 



Vk 



fe=l 



A:=l 



Vk 



fe=l 



(4.12) 



2. Iff is the data function then \\s\\^q < ||/|L^o "-^^ ||s- /IL,o ^ \\J \\w,o- 



N 



3. \\s\ 



wfi pN 



i^ReEs(xW)(yfe-s(x«)). 



fe=i 



JV 



I Je[s] = ^ReE {yk-s{x(^^))W- 

k=l 

Proof. Part 1. From part 3 of Theorem 135 

\\Cxs + \\Cx (s - f)\\l = \\Cxf - <^\\l , 

where <j = (0, y). Part 1 follows from Remark 133 and the definition of the norm on the space V. 
Part 2. If / is the data function then / {x^''^) = yk and part 1 implies part 2. 
Part 3. Substituting / = in equation 4.12 of part 1 yields 

-I W N 2 1 " 2 



fe=l 

JV 



k=l 



k=l 

2 



= ^EM-hM-^^r-hM 
fc=i ^ 

. JV 

= ^ E (-2^ (^^'^)« (^^'0 + 2 Re (s {xi''))yk)) 
fc=i 

„ AT 



114 



4. The Exact smoother 



N 



Part 4. By definition, [s] = p ||s||; ^ + ^ E |s {x^''^) - vA ■ 



fe=i 



Substituting for ||s||^ q using the result of part 2 I get 

fc=l 

-I N 

= 1 Re^ . (xC^)) - . (.C^))) + 1 ^ |. (xC^)) 
fe=i fe=i 

^ N 



N 



Vk 



fe=l 



-i: Re ^ (yfe - s (xC'^) 



fe=i 

AT 



-iRe^(2/fe-s(, 



fe=i 



Remark 141 Part 1 of the last theorem confirms that if s is the Exact smoother then Jg [s] < Je [/] for 
all f € X° i.e. that s minimizes the functional Jg. 



4.5 Semi- inner products and pointwise error estimates 

Following the example of the interpolation seminorm (/ — Xxf^ g)^ q Definition 62 of 2 and it's sequel 
I will define the two semi-inner products (5^/, g)^^ and (/ — 5^/, .g)^^ and use these to study the 
value and the error of the Exact smoother respectively. Of course these semi-inner products are useful 
because {S\f, Rx),^^o = i^xf) i^) and (/ - S^f, Rx)^^o = if - S^f ) {x) and because of the power of 
Hilbert space theory An outcome of this approach is the Exact smoother bound 4.15. 
To define these smoother scminorms I will need the lemma: 

Lemma 142 Suppose f G X° , is the Exact smoother mapping and X = {xW}^^^. Then: 

1- \\S3cf\\l,o = i^xf, fL,o - PN \{pNI + Rx,xr' Sxff. 

2- 11/ - 'Si/C^o = (/ - ^xf, fL,o - pN\ipNI + Rx,x)-' £xff. 
Proof. Part 1. I start with formula 4.10 for S^f- 

\\Sxf\\l,o = (rx{pNI + Rx,xr'£xf,S*x{pNI + Rx,xr'Sxf) 

= (£x£*x (PNI + Rx.xT^ £xf, (pNI + Rx,xr' £x.f] 
= (Rx,x {pNI + Rx,xy^ £xf, {pNI + Rx,x)~^ £xf) 

since £x£x = Rx,x but 

Rx,x {pNI + Rx,x)~^ = {pNI + Rx,x) {pNI + Rx,x)~^ - pN [pNI + Rx,x)~^ 
= I - pN {pNI + Rx,xr\ 
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so that 



W^xfWlfi = {^xf, {pNI + Rx,xr' £xf) - pN {{pNI + Rx,xV £xf, {pNI + Rx^xT^ £xf) 
= {pNI + Rx,x)~^ £xf, f) - pN {pNI + Rx,xr^ £xf ' 



= i^xf, /)^,o - ipNI + Rx,x)-' £xf 



Part 2. Substituting the equation for ||<S^/||^ q Pi"oved in part 1 yields 

11/ - S'xfWlfi = ll/llLo - 2 + WS'xfWl,, 

= 11/11^,0 - 2 (5^/, /).,o + {S'xf, /)^,o - pN {pNI + Rx,x)-' £xf 



\\w,0 



i^xfJho-P^ {pNI + Rx,xr'£xf 



if - S3cf, /)^,o - pN {pNI + Rx,xr' £xf 



Theorem 143 The expressions {Sxf'9)w0 "'^^ if ~ ^xf'9)w o "'"^ ^^^^ semi-inner products on X° 

with null spaces Wq-^. 

Proof. From part 1 of Lemma 142, {S'^f, f}^ „ > and {S%,f, f)^ ^ = 

iff {pNI + i?x x)"^ ^jf / = iff fx/ iff / e VF^^. Part 2 of Lemma 142 can similarly be applied to 

(/-5i/,9L,o-'" 

I can now define our Approximate smoother seminorms on X^. 

Definition 144 Approximate smoother value seminorm and Approximate smoother error 

seminorm on X'^. 

Define the Approximate smoother value seminorm and semi-inner product by 

l/lL = i'Sxf, /)«,,o ' (/' 9)sv = i^xf, 9)wfi ■ 

The qualifi,er 'value' is used because {f,Rx)gy = {Sxf) (■^) • 
Denote the Approximate smoother error seminorm and semi-inner product by 

l/lL = (/ - S%f, fhfi > (/, 9)se = (/ - ^xf, 9)^,0 ■ 

The qualifier 'error' is used because {f,Rx)se = if ~ ^xf) (^)- 

In the next theorem the pointwise error problem for an arbitrary function is reduced to that of 
considering the error of the smoother of Rx at x, where Rx is the Ricsz representer of the evaluation 
functional f ^ f (x). This error will latter be estimated using the values of the basis function near the 
origin. This theorem also gives an error bound that is uniform on W^. 

Tiieorem 145 Since Sj^fd is the Exact smoother of the data function fd S X^ and Sj^Rx is the Exact 
smoother of the Riesz representer data function Rx, we have the bound 



\fd {x) - S$,fd {x)\ < ^{fd - S-^fd, fd)^^o^Rx (x) - {SI,Rx) (x), xGR", (4.13) 



where ^ {fd - S'^fd, /d)^,o - ll/dlL.O' "-^ ^^^^ ^^^^ bound 

\fd {x) - S^xfd {x)\ < ^{fd - Syd, fd)^,o^/^ (0). ^' 



(4.14) 
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se I **' Ise 



Proof. Estimate 4.13 is just the Cauchy- Schwartz inequahty \{fd,Rx)se\ — \ fd\se 
The second inequahty of this theorem follows directly from the inequality 
Wfd - ^xfdWyjfi < ll/dlU,o> proven in part 2 of Theorem 140. Thus 



\U {x) - S'kfd {x)\ < ^iU - S%U, .fd)^^,,jR. (x) - (S'^R.) {x) 

<^J{fd-sydJd)^^WRAx) 
= ^ifd-sydJd)^,oV^)- 

■ 

The last result means that convergence results about arbitrary data functions in X!^ can be proved by 
deriving error results for the Riesz representer function at x. 

I will now use the smoothing value seminorm to put bounds on the smoother that show how the 
smoother behaves for large values of the smoothing parameter p: it decreases to zero at at least the rate 

l/p. 



Theorem 146 If f & and Se is the Exact smoother of f with smoothing coefficient p then 

\Se {x)\ < 



\w,0 

where Ro (0) = (277)"^ G(0) = ||iio||^,o- 
Proof. If Sx is the Exact smoother mapping then 

\s'xfix)\ = \{S'xf,R.)\ < \\sScfL,o WR^Lfi < 

Prom part 1 of Lemma 142 

1 



/IL,o/Mo)> ^ P<Ro{0): 
/IL,o(v/^MO))%"'' p>Ro{0)., 

ll/IL,o/MO)nii4l>^ 



(4.15) 



\w,0 W^xWyjfi 



„,o /MO). 



(4.16) 



{pNI + Rx,xr'Sxf 



< 



and from part 3 of Theorem 52, 



£xf 



< 



, s/N^fR^. Hence 



/)^,o = {£xfY {pNI + Rx,xr' £xf < 



< 



£xf {pNI + Rx,xr^£xf 

l/IL,o {^^/^) -^^{sy,f)^^, 
l/IL,o \/iM0)^^(5i/,/)^,o, 



so that {S%fJ)^Q < 11/11^,0-^0 (0)/p. But by Theorem 145 {S$.f,g)^ Q is a semi-inner product and 
hence by the Cauchy-Schwartz theorem 



(51:/, 5)^,0 1 < ^/(^^fJ)Z^/(^^^9^o^ f,9&Xl, 
which allows us to conclude that 

\S'xf{x)\ = \iSScf,R.L,o\ < ^(<5I:/,/)^,o^(<5|:iix,iix)^,o< ll/IL,o (^MO))'^- 



which when combined with inequality 4.16 proves the first estimates of this theorem. 



Remark 147 These bounds on the smoother make more sense if we note that \ f {x)\ < ||/||^ g yRoW)- 
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The goal of the next theorem is to prove the inequahty of part 3 of this theorem. This inequality is 



proved using the identity of part 2 which is in turn derived by expanding the two terms \\Rx — RxW 
||5|:(J?,-i?,w] 



, Q. This approach is motivated by the identity of part 2 of Theorem 64, namely, 



(x) - (Jxi?x) (x) = \\R, - 11^ - \\Ix (i?x - R:cW] 

side. Here 2x is the intcrpolant mapping. 



^ Q, which was proved by expanding its right 



Theorem 148 Recall that \f\gg = {Sxf — f) (x) is the Exact smoother error seminorm introduced in 
Definition 144- Then for x,y G K''; 



1. 



\\S'x (R. - Ry)t,o = i^xR.) (x) - 2 iS'xR^) {y) + {S'xRy) {y) - 



pN 



{pNI + Rx,x) ^£x{Rx-Ry) 



\Rx ~ R 



\Rx ~ R 



\\Sx {Rx - R 



pN 



{pNI + Rx,x)~^ £x {Rx - Ry) 



3. \Rx\se<\Ry\se + \\R--Ry\Lfi- 
Proof. Part 1. Using the equation of part 1 of Lemma 142 with f = R^ — Ry 

\\Sx (Rx - Ry)\\l,o = {Sx (Rx - Ry) , Rx - Ry)^^o " \ (P^^ + Rx^xV' £x {Rx " Ry) 
= {SxRx,Rx)^fl - 2 {SxRx,Ry)^fl + {SxRy, Ry)wfi " 



pN 



{pNI + Rx.x) ^£x{Rx-Ry) 



= {S'xRx) {x) - 2 {S'xRx) {y) + {S'xRy) {V) - 



- pN {pNI + Rx,x)~^ £x {Rx - Ry) 



Part 2. Since \\Rx - RyWl,^ = Rx {x) - IR^ {y) + Ry {y), by part 1 

11^. - RyWl^o - W^x {Rx - Ry)\\l^o = (^) " i^xRx) {x)) - 2 (i?. {y) - {S'xRx) {y)) ■ 

+ {Ry {y)-iS'xRy)iy)) + 



pN 



{pNI + Rx.x) ^£x{Rx-Ry) 



- \Rx\se + \Ry\se ~ 2 iR<c^Ry)se + 



+ pN 



{pNI + Rx,x)~^ £x {Rx - Ry) 



Rx - RyVse + iP^I + Rx,x) £x {Rx - Ry) 



and rearranging gives the result. 



Part 3. From part 2, \Rx - Ry\^^ < {{R^ - Ry\\^ Q and since {Rx - Ry\^^ > \Rx\se - \Ry\se ^® ^^'^^ 
part 3. ■ 

I now need the following lemma: 

Lemma 149 Suppose the weight function has property W2 for some k > 0. Suppose X = {x^*'^}^^ is 
the independent data for the Exact smoothing operator Sx ■ Then 

R,u) [x^""^) - {S^R^u)) (x^''^) = pNSj,k - {pNf {{pNI + Rx^x)'') , ^ , (4.17) 
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where ( {pNI + Rx,x)~^ ) is the j, k th element of {pNI + Rx,x)~^ ■ Also 

\ ' J j,k 

o<((.iv/ + i?.,.)-)^_^<-l^. 

Proof. Using equation 4.11 for [SxRxU)) (a;^'^^) implies 

(^^'^) - (<5ii?,(.)) (xf'^)) - {SxRxi^))' {pNI + Rx,xr^ £xR. 

= R^w - (SxR^wY {pNI + Rx,xy^ SxR^o 

= R^u) (^e^)) - (ixR^u^y {pNI + Rx,x)~^ [pNik + SxR^m) + 

+ (ixR.ujY (pNI + Rx,xr' {pNik) 
= R^w (x^''^) - (£xRxU)yik+ 

+ (ixR,u)Y {pNI + Rx,xr' (pNik) 
= (£xR,rU))^ {pNI + Rx,xy' (pNik) 

= pN {£xRxU)Y {pNI + Rx,x)~^ h 

= ipN) il {pNI + Rx,xr^ ixR,u) 

= {pN) il {pNI + Rx,x)~^ (pNij + SxR^u)) - 

-{pN)il{pNI + Rx,xr\pm,) 
= pNilij - {pNf il {pNI + Rx,xr' ij 
= pN6j,k - {pNf {{pNI + Rx,xr^) , 

\ / j,k 

where ( {pNI + Rx,x)~^ ) is element j, k of {pNI + Rx,x)~^- 

\ / j,k 

Since pNI + Rx,x is positive definite, {pNI + Rx,x)~ is positive definite and thus 
\l{pNI + Rx,xr^h>(^. 

Remeirk 150 Equations -^.J7 are equivalent to the matrix equation 

Rx,x - {x^'^)) = pNI - {pNf {pNI + Rx,xr^ , 



I.e. 



, R,u) )J = pNI - {pNf {pNI + Rx,xr' . 



The next theorem shows how the error of the smoother of a function Ry is related to the behavior of 
the basis function near the origin, as well as to the number of data points and the smoothing coefficient. 

Theorem 151 Suppose is the Exact smoother mapping with smoothing coefficient p and X = 
{^^^^)l=v ^^en for each x^^^ e X 



Jr, (x) - {S-M (x) < JpN - (pNf UpNI + Rx,xy^) + -^a/g (0) - G (x - xW), 

where {{pNI + Rx,xY^^ is the kth (positive) main diagonal element of the matrix 

{pNI + Rx,xV\ 
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Proof. In part 3 of Theorem 148 it was shown that |i?a;|se < l^ylse ~^ W-^^ ~ -^yWw o '^^^^^ I'Le the 
smoother error seminorm. Setting y = x^''^ G X and using equation 4.17 with j = k this inequahty 
becomes 

\R^\se^\jR- (x) - (x) < (xW) - {S%R,,.>) {x(k)) + -i?,(.,|L^o 

= ^JpN-ipNf ({pNI + Rx,x)-')^^^ + \\R.- IL,o > 

and since (y) = (27r)^''^^ G {y — x) 

ll^x - R^W\\1,0 = {Rx - Ra,W,Rx - -Rx(^))^,0 = R^ - (x^'''^) + Ra^W [x^'''^) 

= ^(g(O)-g(x-x(^0)), (4.18) 

and our result follows. ■ 



4.6 Pointwise error estimates without assuming unisolvent data sets 

In this section we will prove the Exact smoother analogues of the Type 1 and 2 interpolant error estimates 
of Subsection 2.5.2. Recall that for Type 1 estimates no restriction is placed on the weight function 
parameter k but a local smoothness condition is imposed on the basis function near the origin. On the 
other hand Type 2 conditions only assume k > 1 and we use the resultant basis function smoothness 
properties. The interpolation weight function examples will be used, augmented by the central difference 
weight functions. 



4-6.1 Type 1 error estimates {k > 0) 

In the next two corollaries a smoothness condition is applied to the basis function near the origin and 
this will allow an order estimate to be obtained for the pointwise smoother of data functions error on a 
closed, bounded, infinite data region. We start with Riesz data functions: 

Corollary 152 Suppose a weight function has property W2 for some k > and the (real-valued) basis 
function G has the smoothness property assumed by Theorem 59 i.e. assume for some s > and Cg, 
hG>0 that 

G{0)-G{x) <CG\xf' , \x\<hG. (4.19) 

Further, suppose S'^ is the Exact smoother mapping with smoothing coefficient p and independent data 
set X contained in the bounded, closed, infinite data set K . 
Then if hx,K = maxdist {x, X) < hG it follows that 

\Ry (x) - iS'xRy) {x)\ < {-/pN + kG {hx,KY)\ x,y€K, (4.20) 

where 

kG = (27r)-^ ^/2C^. 

Also 

\Ry (x) - {S^xRy) {x)\ < Ro (0) , x,y€ M.'^. (4.21) 

Proof. Let X = {a;^'^)}^^^ and suppose hx,K < hG- Then for each x G K there exists x^''^ € X such 
that G{0)-G{x- a;(*^)) <Cg\x- x'^''^^' < Cg {hx^xf'- Thus Theorem 151 implies that for allxeK 

R, {x) - {S^xRx) {x) < + ^^Jg (0) -G{x- xik)) \ = (y/^+kG {hx,Kry- 
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Suppose y € K. Then setting = Ry in inequality 4.13 I got 

\Ry {x) - {S'M (^)l < ^j{Ry - s\Ry, Rv)^,o^Jr^ (^) - {^xRx) [x) 

= ^Ry (y) - {S'xRy) {y)^R. i^) - {s$,R,) {x) 

< (^y/^ + ka {hx,KTy ■ 
Finally 4.21 follows from 4.14 with fa = Ry. ■ 

When p = the above results give an interpolant error estimates of order 2s. These arc the interpolant 
error estimates obtained in Chapter 2. The next corollary is a simple consequence of inequality 4.13 and 
Corollary 152. This result is an estimate of the smoother error for an arbitrary data function in X^. 

Corollary 153 Suppose: 

1. The basis function G has the properties assumed in Corollary 152 for the constants C'g, he, s > 0. 

2. Suppose Se is the Exact smoother of the arbitrary data function fd € X° on the independent data 
set X contained in the bounded, closed, infinite data set K. 

Then if hx k = maxdist {x,X) < ha we have 



\fd {x) - Se {x)\ < ^{fd - Se, /d)^,o + (/ix,k)') , X€K,fd€X°, (4.22) 



where ka = (27r) * \/2Cg and^{fd - Se,fd)^^o < ||/dlL,o- 
Remark 154 

1. When p = Q the last two corollaries yield the Type 1 interpolant error estimates of Subsection 2.5.2. 
Thus we will label the parameter s the order of convergence. 

2. The factor ^{fd — Sg, fd)^ q ''^ right side of 4-22 enables the error estimate 4-20 for Rx to be 
recovered immediately. This is because 

{Rx — SxRxi Rx)w ~ (-^^ ~ ^xRx) {x) where Sx is the Exact smoother mapping. 

Error suimuary 

Now we combine the smoother bound of Theorem 146 with the error estimates derived in this Subsection 
4.6.1 to obtain the error estimates that will be tested numerically in Section 4.8. 

Theorem 155 Suppose K is a bounded, closed, infinite set and there exist constants Co, s, ho > 
such that 

G (0) - G (x) < C'g \xf\ \x\<hG,xeK. 



Set ka = (27r) * v2Cg. Then the Exact smoother S^fd satisfies the error estimates 

r \\fdL,o{^+kG{hx,Kr), 



\fd{x)-{S%fd){x)\< mini 



/dlL,o VRo (0), (4.23) 
^ ll/dlUK + ll/c/IL,ov/^min{l,«fl}, 



when X £ K and hx,K = maxdist {s,X) < he. Here Rq (0) = (27r) ^ G (0) and X is an independent 
data set contained in K . 
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We also have the corresponding double order convergence estimate 

' {s/W+kG{hx,Ky)\ 

\Ry {x) - (S^Ry) {x)\ < min i Ro (0) , (4.24) 

. ||ii,IL,^ + i?o(0)min{l,«f)}, 

for x,y G K. 

Proof. From Corollary 153 and 4.14 of Theorem 145 we have 

\fd (x) - Se {x)\ < min {ll/^t (V^+ko {hx,Kr) , ||/d|L,o VB^)} • 
From Theorem 146 

\se {x)\ < ll/dlL^o /MO) niin |l, , 

so that 

I/, (x) - 5e (x)| < I/, (x)| + |se (x)| < + ||/,|L,o v^MO) min (l, ^ 



which yields 4.23. Thus when = Ry and x & K 

\Ry ix) - s, ix)\ < \\Ry\\^^^ + \\Ry\\^^, /MO) min |l, 
= P.IUK + ^o(0)min|l,.^° 



P 

which combined with the estimates of Corollary 152 gives the second set of inequalities 4.24. ■ 

Smoother convergence 

I will now introduce the concept of the convergence of a smoother to its data function. Theorem 73 
showed that for a bounded, closed, infinite data region K there exists a nested sequence of independent 
data sets X^*) c X^^+i) c K such that 

hxw if ^ as A; ^ 00. 

An inspection of the error estimates of this subsection clearly shows that the smoothing error can 
tend to zero whilst the smoothing coefficient p remains positive. Consider the estimate 4.22 of Corollary 

153 and suppose is a nested sequence of independent data sets such that hxw k ^ ^- Denote the 
number of points in X^'^-' by A^^;. Then given e > there exists kg such that 

||/dlL,o {f^x(''e),Ky < I when k > k^. Also for N > Nk^ we have hx^^^ < hx(ks),K- Choosing pk 
so that \/ pkNk = 2\\fd\\ — calculation shows that 

|/rf(a;)-.S^(.)/d(x)| < \\fA^^o{^ + kG{hx,Ky) 

< ll/dlL.O VPkNk + ||/d|L,o^G {hx(>'),Ky 

= e. 

We call s the order of convergence of the smoother. 



4-6.2 Type 1 examples 

The interpolant weight function examples are also used here i.e. the radial shifted thin-plate splines, 
Gaussian and Sobolev splines, and the tensor product extended B-splinc weight functions, augmented 
by the tensor product central difference weight functions introduced in Chapter 3. 

When p = the error estimates of this subsection arc the Type 1 interpolation estimates of Subsubsec- 
tion 2.5.2 and consequently the values for s, Cq, ho, kc have already been calculated and are exhibited 
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Exact smoother conver 


gence order estimates 




Type 1 non-unisolvent estimates assuming k > 0: ka — (Stt) \/2Cg- 




Parameter 


Convcrg. 






Weight function 


constraints 


order s 


Gg 


ha 


Sobolev splines 


v-i = i 


1 


77TT 

I|P^0(P)||^(2) 


oo 


2 


2"-ir(t)) 


[v > d/2) 


2 ' 


1 




!, 






Shifted thin-plate 




1 


eq. (4.25) 


OO 


< V < 0) 










Gaussian 




1 




oo 


Extended B-spline 




1 

2 


Gi (0r^||i?Gi|Ux/rfW 


oo 


Central difference 




1 
2 


{2T,)-i^2G^ (0)''-'||£>Gi|U^ W 


oo 


Gi is the univariate basis function used to form the tensor product 


Ky is the modified Bessel function and Ky (r) = r^Ky (r). 



TABLE 4.1. 



in Table 2.1. These values are given below in Table 4.1 augmented by the values obtained for the central 
difference tensor product weight functions from Table 3.1. 
Equations 4.25 are referenced by Table 4.1: 

Cg = |(2r/" + /') (w)| , where f (r) = (l + r^) " and w = (4-25) 



4-6.3 Type 2 error estimates (/?>!) 

The Corollaries 152 and 153 which were used to obtain Type 2 interpolant error estimates are again used 
here by Theorems 156, 157 and 158 to obtain estimates for G{0) — G {x) when k> 1. 

Theorem 156 Suppose a weight function satisfies property W2 for k=1 and denote the basis function 
by G. Then the smoother error estimates of Corollaries 152 and 153 hold where 

CG = -i(v2G)(0)d, s = l, /iG = 00, kG = {2n)-i V-(V2G) (O)v^. (4.26) 

Proof. Theorem 39 showed that if a weight function has k > 1 then 

G(0)-G(a;)<-^(V2G)(0)|a;|^ x^W^. (4.27) 

An application of Corollary 152 now proves this theorem directly. ■ 
However, if the weight function is radial we can use the estimates: 

Theorem 157 Suppose a radial weight function satisfies property W2 for k = 1 and denote the 

(radial) basis function by G. Set r = \x\. Then: 

1. IfG{x)=f{r^) then 

G{0)-G{x)<-f'{0)d^\x\^ , xGR'^. (4.28) 

2. IfG{x) =g{r) then 

G{id)-G{x)<-^g" {{))d'^\x\^ , x€W^, (4.29) 
Further, the smoother error estimates of Corollaries 152 and 153 hold for 

Cg = -f (0) rf' or - ^g" (0) s = 1, /ig = oo, kG = {2n)-i ^J2€^. (4.30) 

Proof. Follows directly from Theorem 39 and Corollary 152. ■ 

If the weight function is a tensor product the following result will be useful: 



4.6 Pointwise error estimates without assuming unisolvent data sets 
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Theorem 158 Suppose a tensor product weight function satisfies property W2 for k = 1 and denote 
the univariate basis function by G\ . Then 

G{0)-G (x) < -^Gi {Of-^ D^Gi (0) , x e 

and the smoother error estimates of Corollaries 152 and 153 hold for 

CG = ~G^{0f-^D^G^{Q), s = l, /iG = oo, kc = (27r)-^ ^-Gi (Qf-^D^G^ {0)Vd. 

Proof. Since Gi is the univariate basis function 

d d d 

(V^G) (0) = J2 (DIG) (0) (^i) (^2) . . . Gi (x,)) (0) = ^ Gi (O)'^"^ D^Gi (0) 

fe=i fe=i fc=i 

= Gi {of-^D^Gi {0)d, 

The estimates of this theorem then follow from Theorem 156, 4.27 and an application of Corollary 152 



4-6.4 Type 2 examples 

When p = the smoother error estimates of Corollaries 152 and 153 become algebraically identical to 
the Type 2 interpolant error estimates of Subsubsection 2.5.2. Further, the weight function examples 
used above were also used for the Type 2 interpolation examples. If we use the 'radial' Theorem 157 
to do the estimates for the radial basis functions and Theorem 156 to do the estimates for the tensor 
product basis functions then the values of the variables s, Cg, he, kG will match those obtained for the 
interpolants. These are given below in Table 4.2 which is a copy of Table 2.2 augmented by the results 
for the central difference tensor product weight functions from Table 3.1. 
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Type 2 non-unisolvent estimates assuming k > 1 


Weight function 


Parameter 
constraints 


Converg. 
order 


{2Tr f^kc/Vd 


Sobolev splines 
(v > d/2) 


v-l>2 


1 


lT(v~d/2-l) 

V 2<i/2+ir(t)) 


l<v-^<2 


1 


/ T{v-d/2-l) 

V 22"-''/2-3r(v) 


Shifted thin-plate 
{-d/2 <v<0) 




1 




Gaussian 




1 


^/2 


Extended B-spline 
(1 < n < 


n > 2 


1 


^-Gi {Qf-^D^G^ (0) 


Central difference 

(1 < n < 


n > 2 


1 


^-Gi {Qf-^D^G^ (0) 


Gi is the univariate basis function used to form the tensor product. 



TABLE 4.2. 



When p = we see that the order of convergence is at least 1 for an arbitrary data function and at 
least 2 for a Riesz representer data function, no matter what value k takes. However, in the next section 
we will show that by assuming the independent data is unisolvent of order k > 1 it follows that an order 
of convergence of at least [kJ can be attained for an arbitrary data function and an order of convergence 
of [2kJ for a Riesz representer data function. 
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4.7 Pointwise error estimates obtained using unisolvent data sets 

In this section we will derive smoother error resiilts using the Lagrange interpolation theory. These results 
require that the independent data points constitute a unisolvent set and assumes that the boundary of 
the data region boundary satisfies the cone condition. 

These results achieve the same order of convergence as the interpolation results of Subsection 2.5.3 in 
the sense that when we set p = we get the same interpolation error estimates. 

Unisolvent sets were introduced in Subsection 2.5.3 where they were used to define the Lagrange 
interpolation operators V and Q = I — V. Lemma 72 supplied the key interpolation theory and I 
reproduce it here: 

Lemma 159 ( Copy of Lemma 72) Suppose first that: 

1. fl is a bounded, open, connected subset o/R'' having the cone property. 

2. X is a unisolvent subset offl of order m. 

Suppose {1]}^^^ is the cardinal basis of Pm with respect to a minimal unisolvent subset ofQ. Again, 
using Lagrange polynomial interpolation techniques, it can be shown there exists a constant K'q „ > 
such that 

M 

and all minimal unisolvent subsets offl. 
Now define the data point density measure 

hx,n = sup dist (w, X) , 
wen 

and fix X G X. By using Lagrange interpolation techniques it can be shown there are constants 
cn,mj^a,m > such that when hx,n < hn^m there exists a minimal unisolvent set Ac X satisfying 

diam(AU {x}) < cn,mhx,n- 

Our main result for arbitrary data functions is: 

Theorem 160 

1. Let w be a weight function with property W2 for some parameter k, > 1 and let G be the basis 
function generated by w. Set m = [k\. 

2. Suppose Se is the Exact smoother of an arbitrary data function fa G X^ evaluated on the indepen- 
dent data set X contained in the data region O. 

3. Suppose the notation and assumptions of Lemma 159 hold for this theorem i.e. X is m-unisolvent 
and ft is a bounded, open, connected set whose boundary satisfies the cone condition. 

Then there exist positive constants kc, CQ^m, K'q ^, ha^m such that the smoothing error satisfies 

\fd {X) -Se{x)\<^ (/d - Se, /d)^,o + (/IX.q)™) , X eH, (4.31) 

when hx,Q = sup dist {u, X) < hn^m- 

Here ka = ,^"!d/2 (co,m)™ max ID^^G (0)1 and the constants CQ^m, Kq ™ and hn^m only depend 

(.^"■J ' |/3|=m ' 

on fl, m and d. In terms of the integrals which define weight property W2 I have 

max |£>2/Jg(0)| < {2t,)-'"^ [ 
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Proof. Fix X G By Lemma 159 there exists a minimal unisolvent set A = {ak}^^=i of order m and 
constants cn,m, hx,n > such that 

diam(A U {x}) < cn^mhx,n when hx,n < /iQ,m- 

Further, there exists a positive constant K'q ^ such that 

M 

where is the cardinal basis of P„i with respect to A. The next step is to partition the error 

function using the Lagrange interpolation projections V and Q = I — V so that 

fd-Se=V iU -Se) + Q {fd - Se) , 

and note that V {fd — Se) is not zero. However, I can still apply Lemma 159 to the second term 
Q [fd — Se) in exactly the the same way as in the proof of Theorem 74 to show that 

\Q{fd - Se) {x)\ < kcWfd - Se\\^^oihx,nr , (4.32) 
when hx,n < hn,m- Part 2 of Lemma 142 then implies ||se||^,o - fd)w,o ^"^^ so 

\Q{fd- Se) {x)\ < ka {fd, fd - Se)^,o , 

when hx,Q. < hn,m- However, it still remains to estimate V {fd — Se) {x). By definition of V 

M 

V {fd - Se) {x) = ^ {fd {ak) - Se {ak)) Ik {x) , 

fe=l 

and by 4.13 



\fd {ak) - Se (afe)l < ^J {fd - Se,fd)^fi\jRau (dfe) " ('^x-RoJ K)- (4.33) 



But by Lemma 159, ^ \lk {x)\ < K'q ^ for a; e f2, so 
fe=i 

M 

\r {fd - S'xfd) {x)\<Y, \fd {ak) - Se (afe)l \lk {x) 



fe=i 

/ M \ 



^ |Zfc (a;) I J max \fd {ak) - Se {ak) \ 

-n,m\J {fd, fd — Se)^fi niax J{Ra,Aak)-{S'xRa,){ak)), 
where is Exact smoother operator. Hence Lemma 149 implies 



\r {fd - S'xfd) {x)\ < K'^,.^{fd-Se,fd)^,o^. (4.34) 

Combining inequalities 4.32 and 4.34 gives inequality 4.31 for all x € O. The extension of the inequality 
to is an easy consequence of the fact that fd and are continuous on R'^. ■ 

Observe that when p = the order of convergence derived above is [kJ and this is the same order 
derived for the interpolant of an arbitrary data function in Theorem 74. I now derive a double order of 



convergence result for the Riesz representor data functions Ry. Note that the factor ^{fd — S'^fd, fd)^ g 
in the right side of the estimate 4.31 for \ fd {x) — {S^fd) {x)\ facilitates the following simple proof: 

Corollary 161 Under the conditions and notation of Theorem 160 the estimate 

Ry {x) - {S3,Ry) {x) < (k'^^^^ + Ug {hx,a)'^y , x,yGQ, 
holds when hx,n < hn^rn and m = [kJ . 
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Proof. Suppose hx,Q < hn,m- When = Rx the inequahty proved in Theorem 160 becomes 
{Rx - S^M {x) < ^J{Rx - S^^Rx) {x) (k'^^^^ + ka {hx,nr) , x eH, 
which imphes that 

and an appUcation of the estimate 4.13 with fa = Ry gives the desired result. ■ 
4-7.1 Examples 

The weight function examples used here are those used in for interpolation i.e. the radial shifted thin- 
plate splines, Gaussian and Sobolev splines, and the tensor product extended B-spline weight functions, 
augmented by the tensor product central difference weight functions from Chapter 3. 

When when the smoothing coefficient p is zero the error estimates of this section arc the unisolvent 
interpolation estimates of Subsection 2.5.3 and consequently values for k and the Lagrangian constants 
kG,cn,m,KQ^,hQ^rn are the same. We can't calculate the Lagrangian constants but wc estimated the 
convergence order for the interpolant as max [k\ which is given in Table 2.3. We use this value here 
and they are given below in Table 4.3 augmented by the order values obtained for the central difference 
tensor product weight functions from Table 3.1. 
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Unisolvent data and k> 1 


Weight function 


Parameter 
constraints 


Convergence 
order (max [kJ ) 


^n,max| K 1 


Sobolev splines 

{v > d/2) 


z;- 1=2,3,4,... 






- 1 


00 












Shifted thin-plate 




2,3,4,... 


00 


Gaussian 




2,3,4,... 


oo 


Extended B-spline 
(1 < n < 


n > 2 


n- 1 


oo 


Central difference 
(1 < n < 


n > 2 


n- 1 


oo 



TABLE 4.3. 



4.8 Numerical experiments with the extended B-splines (non-unisolvent 

case) 

In this section the convergence of the Exact smoother to its data function is studied numerically using 
scaled, extended B-splines, an example of which is the hat function. We will use the same 1-dimensional 
data functions and the same extended B-splines, introduced in Theorem 7, that were used for the nu- 
merical interpolation experiments described in Section 2.6. 

We will only consider the numerical experiments in one dimension so that our standard data density 
parameter hx,K = maxdist(x, X) can be easily calculated. It is also easier to test the non-unisolvent 

error estimates i.e. those derived without assuming unisolvent data sets. This is because in the non- 
unisolvent case the constants are known precisely. In the case of unisolvent data the theory is complex 
and it is unclear what are suitable upper bounds for the constants e.g. K'^ ^, CQ^m, hn,m in Theorem 
160. 

Recall from Theorem 7 that the extended B-splines are a special class of tensor product weight functions 
which satisfy property W2. Indeed, for given integers l,n their univariate weight function is defined by 

<>■ 2n 

"'i(^) = TT-^' x = {xx,...,xa)eW^, (4.35) 

, sm x. 
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where the weight function w has property W2 for k iff n and I satisfy 

K+1/2 <n<l. 



(4.36) 



Theorem 29 showed that the basis function G generated by w is the tensor product G {x) = Gi {xk) 

fe=i 

where 

nl 



Gi it) = (-1)'-" J^T^^C-") ((*A)') (I) , 



(4.37) 



and (*A) denotes the convolution of I 1-dimensional hat functions. Further, Gi G Cq " (M^) and 
jr)2n-ig^ is a piecewise constant function. Finally, G € Cq^""^^ (R'') and the derivatives {-D"G}|o,|^2n-i 
are bounded functions. 

Because all the extended B-spline basis weight functions 4.35 have a power of sin a; in the denominator 
we will need the special classes of data functions developed in Section 2.6. 

The 1-dimensional independent X data set is constructed using a uniform distribution on the interval 
K = [—1.5, 1.5]. Each of 20 data files is exponentially sampled using a multiplier of approximately 1.2 
and a maximum of 5000 points, and then we plot logiQhx,K against logj^Q where N = \X\. It then 
seems quite reasonable to use a least-squares linear fit and in this case we obtain 



hxK^ 3.09N' 



-0.81 



For ease of calculation let 



hx,K = hiN-", hi = 3.09, a = 0.81. 



(4.38) 



(4.39) 



Noting the error estimates of Theorem 155 we use 4.39 to write h^x — define the Exact 

smoother error estimates 



and 



where 



\fd (x) - (5i/d) {x)\ <Er,{p), xe K, 

\Rv - iS'kRv) {x) I < £iv (p) , x,y& K, 
r \\fd\Lo{VpN + kGhlN-^), 



En (p) = min < 



and 



ll/dlL,o/Mo). 

\\U\Uk + \\ML,o \/iMO) min {l, ^} , 

+ kchlN-'^'Y , 
£iv(p)=niin^ -Ro (0) , 



(4.40) 
(4.41) 

(4.42) 



(4.43) 



11^.1 



+ 



i?o(0)min{l,M}. 



4-8.1 Extended B-splines with n — 1 
The case n = 1, Z = 1 

The 1-dimensional hat weight function is given by wa (^) = 

\/27r ^ ) i^-^) ^'^^ examining 4.37 we see that it is a scaled, extended B-spline weight function 
corresponding to the parameters n = I = 1. Also 4.36 implies max [k\ = 0. 

Now suppose n is the 1-dimensional rectangular function given by Il{x) = 1 when |a;| < 1/2 and 
n (a;) = when \x\ > 1/2. Then it was shown in Subsubsection 2.6.1 that: 
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Theorem 162 Data function Suppose wa is the univariate hat weight function. Then u * n G -''^^a 
u ^ (^^) ■ Further, we can define V £ L^^ n by 

V{x) = [ u{t)dt, u e i^, 

so that DV — u a.e. and 

M * n = (27r)"^ ^ ^ ^ ^ ^ 

To obtain our data function = u * 11 we will follow the interpolation approach of Subsubsection 
2.6.1 and choose 

u = e , 

for which 

y=(27r)^erf, \\u\\^ = 2 i27r)-- , ||u * n||^,^ ^ = 2, (4.44) 

and 

/rf^u*n = erf (^x+i^ -erf . (4.45) 

Since we are using the hat basis function, 0(0) = 1, Cg = 1, s = 1/2, he = oo, and from 4.44 and 
4.45, = 2. 

For the double rate convergence experiment we will use Rq ~ {2n) ^ A as the data function. 



Numerical results 

We start by plotting the error bounds given by 4.23 and 4.24, together with the actual absolute smoother 
error, against the smoothing parameter. The results for the data function fd are shown in Figure 4.1 and 
the results for the data function Rq are shown in Figure 4.2: 

Exact smoother error vs Smoothing parameter for various Nos of data points 

10' : 




r'\ i I 1 I i I 1 I 1 i I 1 I 1 i 

10"' 10"' 10"' 10"° 10"' 10"' 10"' 10"' 10° io' io' 



Smoothing parameter 

FIGURE 4.1. Exact smoother errors vs smooth parm: data func is erf (2; + |) — erf [x ~ 
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Exact smoother error vs Smoothing Parameter for various Nos of data points. 

10 f- - I- ■-■-■-■-■i- - - - - -r - - - - -i ----- - -i 




10 1 1 1 1 1 

-8 -6 -4-2 2 

10 10 10 10 10 10 

Smoothing parameter 



FIGURE 4.2. Exact smoother errors vs smooth param: hat basis tunc, data tunc is Rq. 



The Figure 4.1 seems typical for C°° data functions and it is clear that except for smoothing parameters 
larger than 10 the predicted error bounds given in 4.42 and 4.43 capture only a small part of the actual 
convergence rate. Indeed, the error ratio for the function Rq is 10^'^ and that for fd ranges from 10^'^ to 
2 X 10^. 

We now move on to examine the relationship between the smoother error and the data density. To do 
this we need to fix a value for the smoothing parameter and we choose p = 10~^. We select this value 
because often the Exact smoother error curve has a minimum near this value. Using the functions and 
parameters discussed at the end of the last subsection we obtain the four subplots displayed in Figure 
4.3, each display being the superposition of 20 smoothers. 

The two upper subplots relate to the data function 4.45 and the lower subplot relates to the Riesz 
representer data function Rq = (27r)~'^^^ G {x — ■) (see 1.37). The right-hand subplots are filtered versions 
of the actual error. The data function is given at the top of the left-hand plots and the annotation at 
the bottom of the figure supplies the following additional information: 

Input parameters 

N = L = 1 - the hat function is a member of the family of scaled, extended B-splines with the indicated 
parameter values. 

spl scale 1/2 - the actual scaling of the spline basis function is 1/2 divided by spl scale, 
sm parm le-6 - the smoothing parameter is 10^^. 

samp 20 - the sample size. This is the number of test data files generated. The data function is 
evaluated on the interval [—1.5, 1.5] at points selected using a uniform (statistical) distribution. 

pts 2:5K - the smallest number of data points is 2 and the largest number of data points is 5000. The 
other values are given in exponential steps with a multiplier of approximately 1.3. 

Output parameters / messages 

No ill-condit - this indicates all Exact smoother matrices were always properly conditioned. 
Note that all the plots shown below have the same format and annotations. 
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Data fn: erf(x+1/2) - erf(x-1/2) 



Filtered error 







E -4 



O 




-4-2 2 
Log10 data density 

Data fn: RJix) 




-4-2 2 
Log10 data density 



-4 



-6 




-4-2 2 
Log10 data density 

Filtered error 




-4-2 2 
Log10 data density 



dim 1, N=1, L=1, spl scale 1/2, sm parm 1e-6, samp 20, 
pts 2:5K, no ill-condit 



FIGURE 4.3. Convergence of Exact smoother. 



As mentioned above the smoother is filtered. The filter calculates the value below which 80% of the 
errors lie. The filter is designed to remove 'large', isolated spikes which dominate the actual smoother 
errors. The smoother error was calculated on a grid with 300 cells applied to the domain of the data 
function. No filter is used for the first five smoothers because there is no instability for small numbers of 
data points. I conclude that in this case the 'spike' filter can only meaningfully applied to the smoother 
of Rq. This filter will be used in all the cases below. 

As the number of points increases the numerical smoother of Rq is observed to simplify ^^jij^qj^^^^j^ 
large increasingly narrow spikes at and ±1 and these dominate by about four orders of magnitude 
a stable residual error function of uniform amplitude and zero trend. The smoother of fd consists of 
intermingled spikes of various heights superimposed on a trend curve of amplitude comparable to the 
average residual spike size. The maximum spike height is at most one order of magnitude of the average 
spike height so the smoother is stable. 

The (blue) points above each smoother in Figure 4.3 represent the theoretical upper bound for the 
error given by the estimates 4.23 or 4.24, and the adjacent (red) line has a slope which gives the actual 
rate of convergence, ignoring instability. Clearly for the data function fd the theoretical error bound 
substantially underestimates the actual error. Regarding the data function Rq, the estimated 'double' 
convergence rate is 2s = 1 and the estimated (unfiltered) rate is also 1 so for p = 10~^ the theoretical 
upper error bound for the data function Rq is able to take into account quite closely the instability of 
the smoother. However, there is a large difference when the filtered error is considered. 



The case n = 1, I — 2 

Amongst other things the following result, extracted from Theorem 77, will allow us to generate data 
functions for 1-dimensional extended B-spline spaces for which the norm can be calculated This 
result is closely related to the calculations done above for the hat function. 
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Theorem 163 (Portion of Theorem 77) Suppose w is the extended B-spline weight function with pa- 
rameters n and I given by 1.20, and that U e (M^), r>"?7 e m the sense of distributions. Then if 
we define the distribution 

fd = 6{U, 1 = 1,2,3,..., 
where 62 is the central difference operator 

S^U = U {■ + !)- U {■-!), (4.46) 

it follows that fd G and 

ll/rflL,o = 2'll^"C^ll2- (4-47) 
Our basis function is the extended B-sphne Gi with parameters n = 1 and I = 2 given by 4.37 i.e. 

Gi it) = (-1)'-" (Z^^e-) ((*A)')) (I) = -f {D' (A * A)) 

But 

1)2 (A * A) = A * D^A = A * ((5 (• + 1) - 2(5 + 5 (• - 1)) 

^ (A(. + 1)-2A + A(.-1))^ 



;|), temK 



so that 



Gi(i) = -^(A(| + 1)-2A(|)+A(|-1)), 



and hence 

DG, it) = (A' (I + 1) - 2A' (I) + A' (1 - 1)) , 

i.e. ll-DGill;^ = ^v^27r. Since the (distributional) derivative is bounded the distributional Taylor series 
expansion of Lemma 42 can be used to write 

Gi(0)-Gi(t)< ||Z)Gi||^|t|, x&R\ 

which means that 

Gi (0) = ^,Cg = \\DGi\\^ = ^V2^, s = 1, /iG = 00. 
With reference to the last theorem we will choose the bell-shaped data function 

fd = 5lU e Xl, (4.48) 

where 

^^^^^ ^|(e-fci,=-^)(0) " 2(l-e-4fci..)' ^1.2=0.3, 

and 



1 _ e-4'=i>2 ■ 



The error estimates are given by 4.23 or 4.24 where kc = (27r)~^ \/2Cb and i?o (0) = (27r)"^ (-q). 

For the theory developed in the previous sections it was convenient to use the simple, unsealed weight 
function definition 4.35 but for computations it may be easier to use the unsealed version of the extended 
B-spline basis function given in the next theorem. 

Theorem 164 Suppose G = (-l)'"" £>2(;-n) (^{^Kf^ 

where A is the 1-dimensional hat function and 

n, I are integers such that 1 <n < I. 

Then for given X > 0, G {Xx) is called a scaled extended B-spline basis function. The corresponding 
weight function is wx (t) = 2Xaw (^) where w is the extended B-spline weight function with parameters 

n, I and a = 22(i'-n)+i ■ Indeed, for wx we can choose k = n — 1. 
Further 

G(0)-G(Aa;) < ApGll^la;!, xeR\ (4.49) 
Finally, if fa G X° and ga (x) = fa (2Ax), it follows that ga G X°^ and WgdW^^^ = \fa ||/d|L,o- 
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Numerical results 

As with the previous case we select p = 10^^ and, using the spike filter of the previous case and the 
functions and parameters discussed in the last subsection, we obtain the four subplots displayed in Figure 
4.4, each display being the superposition of 20 smoothers. 



Data f n : e^Cae"'"'') Filtered error 




-4 -2 2 -4 -2 2 

Log10 data density LoglO data density 

Data fn: R|j(x) Filtered error 




-4 -2 2 -4 -2 2 

Log1 data density Log1 data density 

dim 1 , N=1 , L=2, spl scale 1 , sm parm 1 e-6, samp 20, 
pts 2:5K, no ill-condit. 



FIGURE 4.4. Convergence of Exact smoother. 



These results are significantly different to those obtained for the previous case n = ^ = 1 . As the number 

of points increases the numerical smoother of Rq is observed to quickly simplify to a single very large 

increasingly narrow spike at the origin, and this dominates a stable error function of uniform amplitude 

and zero trend. The smoother of fd typically forms two major spikes at the boundary x = ±3/2 by 

100 data points and then two minor spikes aX x = ±1/2 at about 1000 data points. These spikes are 

superimposed on a trend curve of amplitude comparable to the average spike size of a small stable error 

function. . 

Student Version of MATLAB 

The (blue) points above the actual errors in Figure 4.4 are the estimated upper bounds for the error 
given by the estimates 4.23 or 4.24, and the adjacent (red) line gives the average convergence rate. Clearly 
for the data function fd the theoretical error bound substantially underestimates the convergence rate. 
Regarding the data function i?o, for p — 10^® the theoretical upper error bound for the data function Rq 
is able to take into account quite closely the instability of the smoother. On the other hand, the filtered 
curves for the data function i?o are obtained from the corresponding unfiltered curves by inserting a 
large step when the data density is about 0.5 and so the theoretical error estimates are at least 4 orders 
of magnitude greater than the actual errors. 

4-8.2 Extended B-splines with n = 2 

Since n > 2, we can use the error estimates of Theorem 158 as well as the same scaled B-spline and data 
function that we used for the interpolant in Subsection 4.8.2. 
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The case n — 2, I ~ 2 
The basis function is 

(A.A)(0) =^(A*A)(2x), 

with scahng factor A = 4 and is such that suppG2,2 = [—1, 1] and G2,2 (0) = 1. To calculate G2,2 we use 
the formula 

G2,2 {x) = (1 + xf A {2x + 1) + (1 - 2a;2) A (2a;) + (1 - xf A {2x - 1) , 
and choose the bell-shaped data function 

fd = 5lU e Xl, (4.50) 



where 



so that 



If, ^, ^3/4 

= 7^7174; ||/.IL,o = ^ /_%,. , fci,2=0.3; /zG = oo. (4.52) 

_ 1 _ 1 

For double rate convergence experiments Rq ~ (27r) ^ ^2,2 is the data function and _Ro (0) = (27r) ^ . 

The next step is to calculate H/dHooX ^^'i II-^oIIooa: where K — [—1.5,1.5]. Clearly ||-Ro|loo,if = 
(27r)-^ G2,2 (0) = (27r)"^ and from 4.50 and 4.51, ||/d||oo,K = fd (0) = 1: 

PolL,K = ^o(0) = (27r)-^ ||/d|L,K = l- (4.53) 

Numerical results 

Using these functions and parameters the four subplots of Figure 4.5 each display the superposition of 
20 intcrpolants or 20 filtered intcrpolants as well the curves which show the upper bounds for the error 
given by the estimates 4.23 and 4.24. 

As the number of points increases the numerical smoother of is observed to simplify to a very 
stable, smooth, symmetrical pattern with a single spike at the origin and two smaller spikes at ±1/2, 
each about 1/3 the central spike's height. At ±land ±3/2 there arc small rounded 'spikes'. The smoother 
between the spikes is very smooth and the trend is zero. 

The smoother of fd forms two spikes at the boundary x = ±3/2. These spikes are superimposed on 
a trend curve of amplitude which dominates the amplitude of a stable residual error function. However 
the trend curve has a larger amplitude than the boundary spikes. 

The (blue) points above each smoother in Figure 4.5 are the estimated upper bounds for the error 
given by 4.23 or 4.24. Clearly, in both the filtered and unfiltered cases, the theoretical bound substantially 
underestimates the convergence rate. 



4.9 Approximation of the Exact smoother 

We shall finish this chapter by approximating the Exact smoother by an easily calculated member of 
Wg,x which does not involve calculating the inverse of a matrix. This result will be used to motivate 

the derivation of the Approximate smoother in Subsection 5.3.1 of the next chapter. 

The relevant properties of the vector- valued evaluation operator £x and its adjoint £x w.r.t. were 

proven in Theorem 52. Now suppose is an ordered set of points in E'^ and / e X° . Define 

g^xlhy 

so that 

has the following properties: 
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Data fn: 8^(ae"'"'') 
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Log10 data density 
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dim 1 , N=2, L=2, spl scale 0.25, sm parm 1 e-6, samp 20, 
pts 3:5K, no ill-condit 

FIGURE 4.5. Convergence of the Exact smoother. 



Lemma 165 Suppose f,g satisfy equation 4-54- Then: 
1- ll/-.9lL,o < ll/L,o- 



2. \f{x)-g{x)\<./lhW)\ 



to,0- 



£xf -£xg 



< 



y/NRo{0) 



£xf 



<ll/l 



Student Version of MATLAB 



4- The Exact smoother operator satisfies: \S^f (x) — S^g (x)\ < y/ Rq (0) | 
Proof. By definition of the Riesz representer 



l^-ll„,.o = Po||,„.o = 



so part 3 of Theorem 52 imphes 



-x 



= \\Rx,x\\ < ViVVi?o (0). 



Thus: 
Part 1. 



ll/-.9L,o 



NRq (0) 



£*x£xf 



■w,a 



<ll/L,o- 



(4.56) 



(4.57) 



Part 2. 



\f{x)-g{x)\ = 



NRo (0) 



\£x£xf, Rx 

V / ii,,0 



1 



NRg (0) 



£x£xf 



\R: 



w,0 



<y/RjQ)\\f\L^. 
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Part 3. By part 5 of Theorem 52, £x£-*x = Rx,x so 



£xf - £x9 



NRo (0) 



Rx,x£xf 



< 



NRo (0) 



£xf 



V^Ro (0) 



£xf 



< 



\\w,0 • 



Part 4. Prom part 2 of Theorem 140, the Exact smoother operator satisfies, ||«S^|| < 1. Hence 



\S'^f{x)-S^xg{x) 



1 



NRo (0) 



[^X^X^xfyRx^ 



< 



1 



< 



NRo (0) 
1 



Sx£*x£xf „ll-RolLo 



N^R^ 

< /MO) 11/ 



£*x^xf 



wfi 



Wwfi ■ 



We now prove our approximation theorem: 
Theorem 166 Suppose f,gG satisfy equation 4-54- Then 



S\f{x)- )^^^}, = S^x9ix), XGR", 



and 



Sic fix) 



(i?o (0) +p)N Ro (0) + p 
(£xRx^ £xf 



{Ro{0) + p)N 



^ 2Ro {Of^ . 

< — mm 

- Ro{0) + p 



1 :M0) 
J- 1 



Wwfi ' 



(4.58) 



(4.59) 



and 



S3cf 



£xRx I £xf 



iRo{0) + p)N 



- i?o(0)+p"^""'°' 



(4.60) 



wfi 



Proof. Starting with 4.11 and using the fact that £x£x = Rx,x (part 5 Theorem 52) we have 

SI, fix) = {ixRccY [pNI + Rx,x)~^ £xf 

= (ixR^Y {pNI + Rx,x)~^ + 

= jv]^ {SxR^Y [pNI + Rx,x)~' Rx,x£xf + (ixR^Y {pNI + Rx,x)~' £x9 
= j^^^ {£xR.Y ipNI + Rx,x)-' Rx,x£xf + S\g {x) 

^ ^ixRxY ipNI + Rx,x)'' [pNI + Rx,x)£xf- 



NRo (0) 



NRo (0) 
(^£xRx^ £xf 



^ (£xR.Y iP^I + Rx,x)-' {pN£xf) + S\g {x) 



NRo (0) Ro (0) 



Slcf{x)+SI,g{x). 



(4.61) 
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and solving for S^f {x) gives 

Sic fix) 



1 (SxRx) Sxf S%-g{x 



+ ■ 



I.e. 



{Ro (0) + p)N Ro (0) + p 
(SxRx) ^xf _ (0) 



1 J e_ 

^ ^ Ro(0) 



s'xgi^), 



{Ro (0) +p)N Ro (0) + p 
which is equation 4.58. To prove the estimate 4.59 we will estimate \Sxg {x)\ using the inequality 4.15: 

Ro (0) ■ 



\SI,u{x)\ < ^Ro (0) min <^ 1, 



\w,0 ' 



u e x°. 



(4.62) 



Applying the Exact smoother to equation 4.55 gives 

Slc9{x)=SI,f{x) 



1 



-S^x£*x£xfix)., 



NRo (0) 

and then applying the estimate 4.62 to both terms on the right side yields 

1 



\S%g{x)\<\SI,fix)\ + 



NRo (0) 



S 



x 



i^£*x£xf) (x) 



<v/^min|l,:^|||/|Lo + 
= VWniin|l,^ 



<2V^min|l,:^ 
I P 

where the last step used 4.57. Finally 



^ll^o||^,o 
"^'0 ^ NRo (0) 

\w,0 ' 



min < 1, 



i?o(0) 



£l£xf 



w,0 



S*xSxf 



w,0 



Sic fix) 



{ExRx^ £xf 



{Ro{0) + p)N 



< 



flo(O) 
Ro (0) + p 

mojfif^ 
-Ro (0) + p 



\Slc9ix) 



min < 1 



i?o(0) 



Ww.O ' 



which proves 4.59. 
Remark 167 

1. If we write 



(SxRx^ £xf 



{RoiO) + p)N (i?o(0)+p)7V^ 



AT 



-d/2 



(i?o iO)+p)N^^ 



1 ^ 



(G(0) + (27r)''/^p)^^ 
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it is clear that the mapping f ^{R(^Q)+p)iJ continuous linear operator from to Wq.x 

with the form of a weighted average, the weights being data-translated basis functions. Note also 
that the operator only depends on the values of the data function in the data region. 

2. The approximation 4-59 provides a bound of order for large p. For all p this estimate supplies 
an upper bound o/2-y/i?o (0) ||/||^ o- 

3. The normwise approximation 4-60 provides a bound of order as p —> oo. For all p this estimate 
supplies an upper bound o/ 2 ||/||^ q. 
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5 

The Approximate smoother 



5.1 Introduction 

We call this smoother the Approximate smoother because it approximates the Exact smoother. This is a 
non-parametric, scalable smoother. Here scalable means the numeric effort to calculate the Approximate 
smoother depends linearly on the number of data points. We assume the basis function is real-valued. 

Two different approaches will be taken to defining the Approximate smoother, and both involve for- 
mulating the smoother as the solution of a variational problem. One of these problems will involve 
minimizing the Exact smoother functional 6 over Wg,x' where X' — {x'^}^^^ is an arbitrary set of dis- 
tinct points in M''. The other, equivalent problem, involves finding the function in Wg,x' which is nearest 
to Exact smoother Sg w.r.t. the norm ||-||^ g. If 

N' N' 

(x) = J2 (^) = (2^)"'^' E <^ - ' 

i=l i=l 

denotes the Approximate smoother and y is the dependent data then solving the second problem yields 

Sa =Tx'Se, 

which implies the matrix equation 

{NpRx',x' + Rl,x'Rx,x') a' = R^^x'V, 

where Rx,x' = (jix'. (s^'*''))- The size of the Approximate smoother matrix is N' x N' which is 
independent of the number of data points and suggests scalability. 

The error estimates for the pointwisc convergence of the Approximate smoother to its data function 
/ e X° are based on the simple triangle inequality 

1/ {x) - Sa {x)\ < 1/ {x) - Se {x)\ + \Se {x) - Sa {x)\ , 

and so Section 5.6 will be devoted to estimating jsg (.x) — Sa i-^)]- 

As with the minimal interpolant and the Exact smoother, we will obtain estimates that assume uni- 
solvent data sets as well as the Type 1 and Type 2 estimates that do not involve unisolvency. The 

Approximate smoother convergence orders and the constants are the same as those for the interpolation 
case which are given in the interpolation tables 1, 2, 3 and 4. 



Non-unisolvent data: Type 1 error estimates No a priori assumption is made concerning the weight 
function parameter k but it will be assumed that the basis function satisfies an inequality of the form 1. 
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For example, if it is assumed that the data region K is closed, bounded and infinite then Theorem 189 
establishes that 

\Se (x) - Sa ix)\ < ||/||^_o {hx' ^kY , X £ K, 

and Theorem 206 shows that 

1/ (x) - Sa {x)\ < 11/11^ + ko {hx,Kr + ka {hx',Kr) , x G K, 

when hx,K = maxdist {x,X) < he and hx',K = maxdist {x,X') < ho- 



Non-unisolvent data: Type 2 error estimates If it only assumed that k>1 then by Theorem 196 

|Se (x) - Sa {x)\ < ka ihx',KT , X G , 



and by Theorem 208 



1/ {x) - Sa {x)\ < 11/11^ + kGhx,K + kahx- ,k) , x 

where ka = (27r)^^ ^_ (ySG) (0)Vd. 



Unisolvent data error estimates If X is a unisolvent set of order m > 1 contained in an open, bounded 



data region fl then by Theorem 200 

|Se (x) - Sa {x)\ < ||/||^^o (/lJf',n)" , X G fl, 

and by Theorem 211 

\f{x)- Sa {x)\ < ||/L,o {K'n,^^/pN + ko {hx,Kr + ko {hx',Kr) , a; e O, 

for some constants K'^ ^, ka > 0. We say the orders of convergence are at least m. 

These theoretical error results will be illustrated using the weight function examples from the interpo- 
lation chapter, namely the radial shifted thin-plate splines, Gaussian and Sobolev splines and the tensor 
product extended B-splines. We will also use the central difference weight functions from Chapter 3. 

Numerical results are only presented for the non-unisolvent data cases. Numeric experi- 
ments arc carried out using the same 1-dimcnsional B-splincs and data functions that were used for the 
interpolants. We restrict ourselves to one dimension so that the data density parameters hx,n and hx,K 
can be easily calculated. 

The last section discusses the Sm,oothOperator software (freeware) package which implements the 
Approximate smoother algorithm. It has a full user manual which describe several tutorials and data 
experiments. 



5.2 The convolution space Jq 

Following Dyn [3] we introduce the space Jq and prove some of its properties, including the fact that 
Jg is dense in . This space will be used in the discretization process which derives the Approximate 
smoother problem from the Exact smoother problem in Subsection 5.3.1. 

Definition 168 The space Jg 

Suppose the weight function w satisfies property W2 for some k > and let G he the zero order basis 
function generated by w. Then 

JG = {G*cP:<f>eS}, 

where the convolution G * (p is defined by G * tj) = 'with G G S" and tj) G S. We will sometimes 

write Jg = G * S . 

Theorem 169 The function space Jg has the following properties: 
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1. Jacx^nc^. 

2.1ff€ X° then (/,G* </.)^_o = [/,^ for all G 5. 
3. Jg is dense in X° . 
Proof. Suppose g £ Jg, say g = G * <f) where ^ G S. 

Part l.g = (f)G=^GL^ since property W2 implies ^ € L^. Thus 



< 



< oo, 



w\g\ = J w- 
and so 5 e X° . For any multi- index a > 



and so by Lemma 13, (G *,?!)) e G^°\ Thus G * e Gg'. 
Part 2. / e impUes /e S" n L^^^^ and 



(/ 



We now need some results from part (a) of Subsection 2.8.3 of Vladimirov [12]. Here he defines locally 
integrable functions which have polynomial growth at infinity i.e. a function g G Lj^^ such that / ^.^l^^i-^s 
for some s > 0. Vladimirov states that these functions have the property that g & S' and [g, cj)] = J gcj) 
for e 5. 

Now, from the proof of Theorem 24, f G and so / has polynomial growth at infinity which implies 



Part 3. A standard result is that a subspacc of a Hilbcrt space is dense iff its orthogonal complement 
is {0}. In fact, if (/, G * (p)^ q = for all </> e 5 then part 2 implies / = 0. ■ 



5.3 The Approximate smoother 

Prom part 5 of Summary 174 the matrix equation for the Exact smoother is {Npl + Rx,x) ct = y. The 
construction and solution of this system does not generate a scalable matrix algorithm since the size of 
the smoothing matrix is N x N i.e. its size depends on the number of data points whereas it will turn out 
that the matrix size for a scalable algorithm is independent of the number of data points. In this section 
we will overcome this limitation and derive the Approximate smoother problem by discretizing the Exact 
smoothing problem on a grid. This will be done by using the space Jg = G * S which is dense in and 
was introduced in Definition 168. The space Jg is not necessary for the specification of the Approximate 
smoother problem but this is how I derived the Approximate smoother algorithm: I came across the 
convolution space Jg in Dyn's review paper [3] and decided to approximate its functions using a regular 
rectangular grid X' and the Trapezoidal rule. This led to functions in the finite dimensional basis function 
space Wg,X'- Mininnzing the Exact smoother functional J,. [•] given by 4.1 over the functions in Wg,x' 
yields the Approximate smoother matrix equation 5.13. The size of this matrix is independent of the 
number of data points and hence the construction and solution of the Approximate smoother matrix 
equation is a scalable algorithm. In fact, the size of the matrix is equal to the number of grid points and 
so increases exponentially with the number of dimensions. In practice we are limited to two or three 
dimensions. 

In actual fact, the set X' will be generalized from a grid to any set of distinct points. This could be a 
sparse grid, for example. 
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5.3.1 Approach 1 an analog of the Exact smoothing problem 

In this subsection we will provide some justification for approximating the infinite dimensional Hilbert 
space by a finite dimensional subspace Wg,x', where X' is a regular, rectangular grid of points in 
M.'^. The space Wg^x' will be used to define the Approximate smoothing problem. The set X' will then 
be generalized to include any set of distinct points. This discretisation process turns out to be similar to 
that described in Garcke and Griebel [4]. 

Definition 170 A regular, rectangular grid in R'^ 

Let the grid occupy a rectangle R{a;b), which has left-most point a € M."^ and right-most point b. 
Suppose the grid has M' = {N[, N2, ■ . ■ , N'^) points in each dimension and let h G M.'^ denote the grid 
sizes. 

Then X' = {x'^ = a -\- ha \ a € 'Z'^ and < a < A/"'} is the set of grid points. 

Let N be the number of grid points so that N = {Af')^ = M[M2 ■ ■ -A/"^, and of course we have the 

constraint Af h = b — a. 

By Theorem 169 the space Jg = G * S is dense in X'^ and so we will approximate the functions in 
S using the rectangular grid X' defined above. Our analysis will be matrix-based so we choose an order 

for the grid points and set X = {x'^}^^-^. Integrals on M** will be approximated by integrals on the grid 
region using the trapezoidal rule i.e. 



/ 

grid 



N 

f{x)dx^h^J2f('^n), (5-1) 
n=l 



where /i-"- = /ii/i2 x . . . x /i^ is the volume of a grid element. This will be a two stage approximation: 
first a restriction to the grid rectangle and then an application of the trapezoidal rule. 

Stepl Approximation of the functions in Jg = G * 5 by functions in Wg,x' ■ 

Suppose (j) G S. Then the trapezoidal approximation 5.1 on the grid X' gives 

N' 

G*<j,= I G{--y)(^{y)dyc^^G{--x'J{h'ct>ix'J). (5.2) 

The equations and approximations 5.1 and 5.2 suggest we approximate the space G * 5 by functions 
of the form J2 G {x — x'^) an, a„ e C i.e. by functions in Wg,x'- 

n=l 

Step 2 With this motivation we could now specify a smoothing problem which wc will call an Ap- 
proximate smoothing problem. This would involve minimizing the Exact smoothing functional 4.1 over 
Wg,x' where X' is a rectangular grid. However, since the space Wg,x' is defined when X' is any set of 
distinct points, we will define the following more general problem: 



The Approximate smoothing problem 



Minimize the Exact smoothing functional Jg [/] for / € Wg,x' 
where X' is a set of distinct points in W'-. 



(5.3) 



5.3.2 Approach 2 

In Theorem 166 it was shown that for any / e X° 



S'xfio:) 



T _ 

ExRx ) £xf 



{Ro{0)+p)N 



id 



and it was noted in Remark 167 that (j^Q((y)^_p)jv ^ Wg.x and this approximation is bounded uniformly 
1 W^. Of course Rx {y) = G {x — y). Now suppose our independent data points X are always 
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contained in a bounded open data region n. Overlay ^l with a regular, rectangular grid and let X' = 

{^'n}n=i ^ denote the grid points in CI. Further assume that G and / are Lipschitz continuous on K"^. 

Since / is Lipschitz continuous on R'^ all the derivatives exist and are bounded a.e. Noting part 2 of 
Remark 78 we have X° c H^'°°, G e H'^'°° and for each x, € H^''^. Indeed 



(5.4) 



Next we want to express (^£xRx^ ^xf in terms of (Sx'Rx^ ^x'f and a remainder term and then 
estimate the remainder term. But 



m=l 



1 T 1 ^ , Af' 



where 



= {x'™) G X : x^"*) - 4 e i? [0; V2)} 



means that {X'f.}^^-^ partitions X. 
The properties 5.4 of Rxf mean that the distribution Taylor series expansion of Lemma 42 can now 
be brought into play to yield 



R. 



(a^f™)) / = R, (4) / (4) + 7^l (i?,/) (4, - 4) , 



and for a;'™) e Xf. 



(7^l (i?./)) - 4) I < V^max \\D^ {R^f) 



x^""^ - 4 



< ^niax||l)''(it!^/)|| \h\ 

< — max llD^ii^ll max\\D^f\ 

- 2 |/3|<lll '^ll°°|/3|<lll 'I 



(5.6) 



Thus 



E = E ^x(4)/(4)+ E (^./) (4, ^'"^-4) 

= 7v^i?,(4)/(4)+ E ^i(i?x/)(4,a;('")-4), 

and jj^ (^£xRx^ £xf can now be written 

1 ~ T _ ^' N' ^ ^' 



k=ix(^'iexi 



with 



N' 



N' 

Y.RAx'u)^f{x'u)^WG,x'. 



fe=l 



N 



Consequently the remainder estimate 5.6 implies 

^(Mx) fx/- E^- (4)^/(4) <^E E |7^i(i?./)(4,^^'"^-4) 



fe=i 



fc=i x("i)ext 

N' 



y — max||£)'5i?^|| max||£)^/|| \h\ 
^ 2 |/3|<i" "°°|/3|<i" "°° ' ' 



A;=la;(m)gX; 



— maxllD'^iJ^II max||£)'^/|| \h\, 



2 |/3|<1 



I/3|<1' 



(5.7) 
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giving a linear dependence on the grid size. 

The last estimate can be combined with 4.58 to give 



N' 



Ro (0) + p 



3/2 



< 



2flo (0) 
Ro{0) + p 



P 



\w,0 ■ 



+ ^max||D'5i?, 

2 |/3|<i" 



I max\\D<^f\ 



\h\, X € 



(5.8) 



so that as N oo and \h\ this approximation is bounded pointwise on E''. 
Now if a weight function has k > 1 then X° c C^^ , G G C^^ and so D^f and D^G are always Lipschitz 
continuous when |/3| < 1. This certainly applies to the shifted thin-plate splines and the Gaussian. 

We will now try to improve this approximation by replacing the vector (a^fe)^ (^o (0) + p)~^ by 

an arbitrary complex vector a' E and then finding a uniform bound for 

when X gR''' and / e X^. In fact we use the standard approach of writing 



< 



Sy{x)-(£*x,a') {x) 
ll^oll 



w,0 



and then show there is a unique a' which 



w,0 



. This is just the adjoint 



formulation of the minimum seminorm interpolation problem and the calculations are simplified since G 
is real. 

Theorem 171 Ifg G then R^-^Ex'g = argmin g — £^,a' is unique and R^^-^Sx' g = Ix'g- 

a'eC' ■"'.0 

Proof. If a' e C^' then 







g-£*x,a' 


w,0 \ 



'X' 



'X' 



= llffll^^o - 2 Re {g,l*x.cl\ + 

= ll5ll^,o - 2 Re [Zx'g, «') + (a, £x'£*x'ol^ 
= llsll^^o - 2 Re {Ex'9, a') + (a', Rx,xol) , 

and if we substitute a' = (3' + i-y' and differentiate w.r.t. (3' and then 7' we obtain a unique minimum 
at a' = Rx]x^x'g- ■ 

This theorem implies that the closest point in Wg,x' to «S^/ under || • ||^ q ^x'Sxf, and so we define 
the Approximate smoother operator <S^ by 



Sx = Ix'S' 



X'Ox 



(5.9) 



It was observed that the Approximate smoother problem 5.3 still makes sense if X' is any set of distinct 

points in R'' and clearly Definition 5.9 is still meaningful if X' is any set of distinct points. In part 2 of 
Corollary 176 we will show that the solution of the Approximate smoother problem is the Approximate 
smoother 5.9. 



5.4 Preparation: the Exact smoother mapping and data functions 

This section presents a convenient summary of the Exact smoother mapping properties required for our 
study of the Approximate smoother problem 5.3. 
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The Exact smoother problem was described in the Introduction and was solved in Chapter 4 using a 
geometric Hilbert space framework which involved the introduction of the Hilbert product space V and 
the operator Cx '■ — > V: 

Definition 172 The space V and the operator Cx 

1. LetV = Xl®'C^ he the Hilbert product space with norm and inner product {■,-)y given by 

{{UI,U2) ,{vi,V2))v = P {ui,Vi)^ Q + iu2,V2)cN ■ 

2. Let the operator Cx '■ X':^ ^ V be defined by Cxf = (^f,£xf^ where £x is the vector-valued 
evaluation function £xf = (/ (a;^^))) of Definition 51. 

The Exact smoothing functional Je can now be written 

Uf] = \\Cxf-{Q,y)ty, fexl (5.10) 

and the Exact smoother Sg is the unique orthogonal projection of (0, y) onto the infinite dimensional 
subspace Cx (X^) . Using this approach it was shown in part 4 of Theorem 135 that 

se = ^ {c*xCxr' £*xy, yec^. (5.11) 

Definition 173 Data functions and the Exact smoother mapping Sx ■ X'^ Wg,x- 

Given an independent data set X each member of X'^ is assumed to act as a legitimate data function 
f and generate a dependent data vector £xf ■ Further, equation 5.11 enables us to define a mapping 
Sx '■ X^ Wg,x from the data functions to their corresponding Exact smoothers defined by 

S%f=]^{C*xCxr'£*xSxf, f€Xl (5.12) 

By Corollary 137, C*xCx is a homeomorphism from X'^ to X'^ so Sx is a continuous linear mapping. 

Summary 174 This is a list of some of the properties of the Exact smoother mapping Sx which will be 
used in this document: 

1. Sx maps X^ onto Wg,x and null<S^ = Wqx- Also, Sx is self-adjoint but not a projection. 

2- l|5i/|L,o < ||/|L,o ^rid Wil 5^) /|L,o < ll/IL,o. / e 

3. Sy = f-p{C*xCx)-^f, feXl. 

4. S-xf = £*x {Npl + Rx,xr' £xf, f G Xl. 

N 

5. Sxf = Yl ctkR^w, where a = (a^) satisfies the matrix equation 

fe=i 

{Npl + Rx,x) ct = y, where y = £xf is the dependent data. 

Proof. We just list the references from Chapter 4: 

Part 1 Corollary 137; Part 2 Part 2 of Theorem 140; Part 3 Corollary 137; Parts 4 and 5 Use part 
3 of Theorem 138. ■ 
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5.5 Properties of the Approximate smoother problem 

It will be shown in part 1 of the next theorem that the Approximate smoothing problem has a unique 
solution, which we will call the Approximate^ smoother. 

Theorem 175 Denote the data for the Approximate smoothing problem by [X; y] and X'. Set ? = (0, y) 
so that by 5.10, Je [f] = \\^xf — '^Wy Exact smoothing functional. 

Then the Approximate smoothing problem has a unique solution Sa € Wg,x' which satisfies: 

1. Je [sa] < Je [f], for all f G Wg,x' and f ^ Sa. 

2. {Cxsa - CxSa - Cxf)v = 0, for all f e Wg,x' ■ 

3. \\CxSa - ^lly + \\CxSa " Cxffy = Uxf " ^ ||^, for all f G Wg,X' . 
Note that the equations of parts 2 and 3 are directly equivalent. 

I (c*xCxSa-ir£*xy, f) =0, for all f €Wg,x'. 

Proof. Part 1. We want to show that there is a unique function in Wa,x' which minimizes the Exact 
smoothing functional Je [■] over Wg,x'- Since Wg,x' is a finite dimensional subspace of X^, its image 
under Cx must be a finite dimensional subspace of V and hence a closed subspace of V. 

Consequently, there exists a unique element of Cx {Wg,x'), say w, which is the projection of C, onto 
Cx {Wg,x-), such that \\v - <;\\y < \\Cxf - <;\\y for all / G Wg,x' such that Cx (/) ^ v. 

Since Cx is 1-1 on X° there exists a unique element of Wg,x', call it Sa, such that v = Cx (sa)- 

In terms of Je we have Jg [sq] < Je [/] for all / G Wg,x' and f ^ Sa- 

Parts 2 and 3. Since v is the projection of onto Cx {Wg,x') simple Hilbert space geometry yields 

the equivalent equations of parts 2 and 3. 

Part 4. We stert with the equation of part 2 and use the fact from part 3 of Theorem 134 that 
CxU = pui + j^£xU2, (wi, ^2) G V. Noting that X has N data points we have for all / G Wg,x' 

= {CxSa - <.,CxSa - Cxf )v = i^XSa - ^,Cx (Sa - 

= {C*xCxSa - C*x';, Sa - f)^^o 

^*X^XSa - -^^XV, Sa- fj 

/ wfi 

But So G Wg,x' so 

(c*xCxSa-^S*xy,f) =0, feWG,X', 
V -'^ /wfi 

and thus C\CxSa — jj^xV ^ x' t'Y definition of the orthogonal complement Wq x'- ■ 
Part 4 is used to prove the next two corollaries. 

Corollary 176 Suppose Se is the Exact smoother of the data [X,y]. Suppose Sa is the Approximate 
smoother of the data [X,y] generated by the points X' . Then: 

1. Se- Sa& W^x' £x'Se = £x'Sa- 

2. Sa = Sx' i^X'^x'^ £x'Se = Ex'^X' ,X'^X' Sg = Ix'Se. 

3. Sa = jjSx'^X^X'^X' {C*xCx) ^£*xV- 

Proof. Part 1. From the proof of part 4 of the previous theorem C*^CxSa — j^^xV ^ ^g x' ®° 
C*xCxSa - jjS'xV = .9 foi' some g G W^x' a-^d - ^ {C*xCx)~^ SxV = {^*x^x)~^ g- But by equation 
5.12, Se = {C*xCxy^ £*xy and so - Se = {C*xCxy^ g- 
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Further, from part 3 of Summary 174, 8x9 = 9 — P {i^*x^x) ^ 9 and thus 

{C*xCx)~^ 9 = since null<S^ = Wq ^'- Hence Sa — Sg = ^g & Wq ^' which is characterized by 

Sx' {Se - Sa) = i.e. Sx'Se = Sx'Sa- 

Part 2. Prom Theorem 52 we know £x ■ — > Wg,x and £x> ■ — > Wg,x' are onto. Hence, since 
Sa G Wg,x' and Sg € Wg,x we have Sg = £x^ ^^'^ = ^x'(^ fo'^ some a € and /3 e . Parts 1 
and 5 of Theorem 52 now imply 







= £x' (Se - Sa) = Sx' {se - ^x'f^ = ^X'Se - Ex'^x'P = ^X'Se - Rx',X'P, 



SO that f3 = x'^x'Se and Sa = £x'(^ — ^x'-^x' x'^x'Se, as required. 

The last equality follows from the formula for the minimal norm interpolant operator Ix' given by 
equation 2.11, namely Ix'f = ^x' {Rx',x')~^ ^x'f- 

Part 3. From the proof of part 1, Se = jf {Cx'^x)~^ ^xV- Hence by part 2 



Sa = £x'Rx\x'^X'Se = £x' Rx\x'^X' j;^ {^*x^x) ^ SxU 

= jq^*x'R'}c',x'^x' {C*xCx) ^s*xy- 

■ 

The following corollary contains results which are analogous to some of those derived in Theorem 140 
for the Exact smoother. 

Corollary 177 Suppose Sa € Wg,x' is the (unique) Approximate smoother of the data [X,y] generated 
by the points X' . Then: 

1- (sa,/)^,o = w-piy- {sa)xf fx, when f e Wg,X'- 



3. Je{Sa) = jf{y- iSa)xfy- 

Proof. Part 1. Prom the definition of jO-xJO-x 

C*xCxSa - J^^*xy = PSa + ^£*x£xSa - Jj^xV = PSa + ^£x {{Sa)x ' V) ■ 

Thus, when / e Wg,x' 

= (^C*xCxSa - ^£*xy, /) ^ = {pSa, /)^,o + ^ {^*X iiSa)x ' V) , /)^_^ 

= P (Sa, /)„,o + (iSa)x " 2/> ^X f 



= P {Sa, /)^,o + {{Sa)x - y. fx)c^ 

1 J, 

= p{SaJ)^^0 + j;^iMx-y) fx- 



Part 2. If / = Sa in the equation proved in part 1, then 

= P ||Sa||^,o + - y> {Sa)x)c^ = P hAwfi + {{Sa) x " vf (Sa)x' 



SO that ||Sa|L,o = jNiy- iSa)x) (Sa)jf ■ 
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Part 3. 



P 
] 

~N 



Je{s) = p\\Sa\\l,^0 + ^\{Sa)x - y? 

= P lka||^,o + ^ - ((Sa)x " 

^ (y - (so)x)^ Mx + {Mx Mx - isa)x y - y^{sa)x + y'^y) 
= ^ (y'^Mx - Mx Mx + Mx Mx - Mx y - y'^Mx + y^y) 

= {sa)xy + y^y) 

1 j,_ 
= j^iy- Mx) y- 

■ 

In a manner analogous to the minimal norm intcrpolant mapping Tx of Definition 55 and the Exact 
smoother mapping Sx of Definition 136 which map data functions to their interpolant and smoother 
respectively, we will now define the Approximate smoother mapping: 

Definition 178 The Approximate smoother mapping Sx x' ■ -^w ^ Wg,x' 

Given an independent data set X, we shall assume that each member of can act as a legitimate 

data function fd o,nd generate the data vector Ex. fd- 

The equation of part 2 of Corollary 176 enables us to define a continuous linear mapping Sx x' '■ 

X^ Wg,x' from the data functions to the corresponding Approximate smoother of the data X, £xfd 
generated by the points X' . This mapping is given by 

^x,x'fd = -j:^^*x'^'x',x'^x' {^*x^x) ^ £*x£xfd, fd e X° . 
We now prove some properties of the Approximate smoother mapping. 

Corollary 179 Suppose Sx x' Approximate smoother mapping. Then Sx x' — ^x'S% where 

Xx' is the minimal norm interpolant mapping with independent data X' and S^ is the Exact smoother 
mapping with independent data X. 

Further, S'^ x' self-adjoint and ||5^,x'/|L^o - H-^L.o f°'^ / ^ ^w- 

Proof. That iS^ = Ix'Sj^ follows immediately from 5.12 for 5^ and equation 2.11 for Tx'- 5^ x' 
self-adjoint since Tx' and Sx are both self-adjoint. Finally, from part 2 Summary 174,||5x.x'/||^ q — 
since „ < and ||Jx^/L,o < ll/IL,o for all / G XO. ^ 

In the last corollary it was shown that the Approximate smoother is an interpolant of the Exact 
smoother. Hence the Approximate smoother uses the same information about the data function as the 
Exact smoother does i.e. it involves no additional data function evaluations. 

5. 5. 1 The matrix equation for the Approximate smoother 

We now know the Approximate smoother exists and is unique. The next step is to derive its matrix 
equation. 



Theorem 180 Suppose s G Wg,x' is the (unique) Approximate smoother of the data [X,y] generated 

join 

N' 



by the distinct points X' = {x'j}^^^. 



Then s{x) = Oi'iRx'. {x), where a' = (a^) e and a' satisfies 
i=l ' 

{NpRx',x' + Rx,x'Rx,x') a = (5-13) 
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_d _d 

where Rx,x = (27r) ^ Gx,x and Rx,x' = (27r) ^ Gx,x' ■ 

The matrix on the left of equation 5.13 will he called the Approximate smoother matrix and it will 
usually he denoted hy the symhol "i. 

Proof. This proof uses the results Ex'^x — Rx',x and Ex^x' — Rx,x' which follow from part 6 of 
Theorem 52. Prom Theorem 175, {cXLxs - w^xV^ f) =0 for all f gWgx'- Substituting / = R^' 

yields £x' (jC^Cxs — j^S^V^ — 0, and since by part 4 Theorem 134, C^JCxs = ps+ j^S^^xs, we have 

= £x' [c*xCxs - -^^*xyj = ^x' (^ps + ^£*x£xs - j^^xV 

= pSx'S + ^Rx',x£xs - ^Rx',xy 

= Npsx' + Rx',xsx - Rx',xy- (5.14) 

N' 

The next step is to write sx' and sx in terms of basis functions. But s{x) = Yl ct'iRx'. {x) = £x'^' 
implies 

Sx = Rx,x'a', SX' = Rx',x'a', (5.15) 

so that 5.14 becomes 

= NpRx',x'a' + Rx',xRx,x'Oi' - Rx',xy, 
which can be rearranged to give the desired matrix equation 5.13. ■ 

Remark 181 

1. Note that the reason the matrix equation 5.13 is more concise written in matrices hased on Rx 
than the basis function G is that I have chosen the 'symmetric' Fourier transform pair /(^) = 

(27r)"^ /e-*^«/(a;)rfa; and /(O = (27r)"^ /e*^«/(a;)rfa;. 

2. The 'hasis function form' of the Approximate smoother matrix equation 5.13 is 

((27r)5 NpGx-,x' + Gl^x'Gx,x) a' = 
with the smoother given hy 

N' 

«(^) = ^OLfi{x-x'i). 

i=l 

In the next theorem we prove some properties of the Approximate smoother matrix. 

Theorem 182 The Approximate smoother matrix \1/ specified in Theorem 180 has the following proper- 
ties: 

1. is real valued, symmetric, positive definite and regular. 

2. ^' has size N' x N'. Hence the size of^ is independent of the number of (scattered) data points. 

Proof. Part 1. Since G is a Hermitian function and is assumed to be real valued in this document, it 
follows that Rx',x' and Rx',x have real elements, Rx',x' is symmetric and so ^ is real and symmetric. 
Further, if a e 

a'^^a = {NpRx',x' + Rx,x'Rx,x') a = Npa'^ Rx',x'a + a'^ Rx,x'Rx,x'a 

= N pa^ Rx' ,x'a + \Rx,x'af' . 

But from the Introduction 5.1 Rx',x' is positive definite and so a^^a > iff a = which implies 
that ^ is positive definite over C, and since G is real we have that \& is regular. 

Part 2. From the block sizes it is clear that 5* is square with N' + 2M rows. Hence the size of ^ is 
independent of the number of (scattered) data points N. m 
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5.5.2 The scalability of the Approximate smoother 

From the basis function G, the iV independent data points X and the N' points X' , we construct the 
N' X N' matrix Rx',x' = (27r)"^ Gx',x' and the N x N' matrix Rx,x' = (27r)"2 Gx,x'- For a specified 
smoothing parameter p and independent data y, we construct the matrix equation 5.13 i.e. 

{NpRx',x' + R^x,x'Rx,x') a' = R^x,x'y- 

The system is solved for a' and the Approximate smoother is evaluated at various points Z using the 
formula sz = Rz.x'Ct'. 

Our next result shows the algorithm is scalable i.e. the time of construction and execution of the 
solution is proportional to the number of data points. This is in contrast with the Exact smoother which 
is not scalable but which has quadratic dependency on the number of data points. 

Corollary 183 The Approximate smoother algorithm is scalable. 

Proof. Suppose the evaluation cost for G (x) is ma multiplications and that N :s> N' . The construction 
cost for ^ is 

{N'f ma + N'NmG + N'N + N'Nma - 2N'NmG. 

The solution cost of an N' x N' matrix equation is ^ {N'f multiplications for a dense matrix. Thus 
the total cost is 2N'NmG + 5 {N'f which is linearly dependent on the number of data points. 

However, by the use of a basis function with support containing only several points in X' e.g. the hat 
function, the construction and solution costs can be reduced significantly. However we still have linear 
dependency on N. ■ 



5.6 Convergence of the Approximate smoother to the Exact smoother 

In this Section we will prove several results concerning the convergence of the Exact smoother to the 
Approximate smoother. We will start by proving some general results which only assume that the weight 
function has property W2. These results yield no convergence orders (hence called order-less) but show 
that the Exact smoother always converges to the Approximate smoother uniformly pointwise and norm- 
wise as the density of the set X' increases. They will be applicable to Approximate smoothers generated 
by regular rectangular grids and sparse grids. 

We then derive pointwise order of convergence results which are analogous to the results derived for 
the minimal norm interpolant in Subsections 2.5.2 and 2.5.3, and for the Exact smoother in Sections 4.6 
and 4.7. Type 1 and Type 2 results are first derived for the case where no assumption is made regarding 
the unisolvency (Definition 66) of the independent data. Then it is shown that if the data is unisolvent 
of order m then the order of convergence of the smoother is also m. 

5.6.1 Order-less convergence results 

The results of this subsection only assume that the weight function has property W2 so that the data 
functions are continuous. Uniform pointwise convergence results are derived but no order of con- 
vergence results are obtained. To start with we will need a definition of convergence for sequences of 
independent data sets. 

Definition 184 Convergence of independent data sets 

( 

1. A sequence of independent data X„ = I said to converge to the independent data X = 

i^^^^)i=i' denoted Xn X , if there exists an integer K such that N^ = N when n > K, and for 



each i, 



as n ^ 00. 



2. When k > K, Xn and X can be regarded as members of R^''- and convergence as convergence in 
M.'^'^ under the Euclidean norm. 
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Theorem 185 Suppose Sg is the Exact smoother generated by the data [X;y]. Suppose X!^ is a se- 
quence of independent data sets which converge to X in the sense of Definition 184 o-'nd that s^a^ is the 
Approximate smoother generated by X'^ and [X\y]. 

Then the Approximate smoothers satisfy Jg si"^ Je [se] as n —> oo, where 



Je[f\=p\\f\\l,o + ^\fx-y\'. 



is the Exact smoother functional 4-1- 



Proof. W'c first note that the definition of the convergence of independent data sets aUows us to assume 
that the X'^ have the same number of points as X. 

Now suppose X' is an arbitrary independent data set with the same number of points as X, and 
Sa = Sa {X') is the corresponding Approximate smoother. If it can be sliown that as a function of X' , 
Je [sa {X')] is continuous everywhere the the theorem holds since Je [sg] = Je [sa {X)]. 

In fact, by Theorem 180 

N 

Sa {X') {x)=Y,Ri.X-<)a'i, 
i=l 

where a' = {a'^) satisfies the matrix equation 

^a' = Rx',xy, 

and 

* = NpRx',x' + R^^x'Rx.x'- 
Starting with part 3 of Corollary 177 and noting that {a')^ = ('S>~^Rx',xy)^ = y^Rx,x'^ ^ we have 

Je K {X')] = ^{y-Sa {X')^fy= 1 \yt - ls„ {X')ly 

= \y? - ^ {Rx,x'a'fy 

= l\y\'-^ic^'f{Rx,x'fy 
= \yf - j^y^Rx,x''b~^Rx',xy- 

If we can show that Rx,x' and are continuous functions of X' in a neighborhood of X, then we 
have Je [sa {X')] is a continuous function of X' . But Rx',x' and Rx,x' are clearly continuous for all X' 
so '^x' and dcf^x' arc contirnioiis for all X' . Further, since ^f^' is positive definite and regular for all 
X', det > for all X' and it is clear from Crammer's rule that is continuous everywhere. Thus 
Je [sa {X')] is continuous everywhere and the proof is complete. ■ 

The next corollary shows that the Approximate smoother converges to the Exact smoother, both 
pointwise and norm-wise, as the grid size goes to zero. 

Corollary 186 Suppose Sg is the Exact smoother generated by the da,ta, [X; y] and that we have a se- 
quence of regular grids X'^ with a common rectangular grid boundary and grid sizes hn- Suppose X lies in 
the interior of the common grid boundary. Also, let Sa {X!^) denote the Approximate smoother generated 
by X'„ and [X;y]. 

Then |/i„| implies \\sa (X!^) — Se||^ o ^ *-* '^^'^ ll'*" i-^n) ~ ■^elloo ~^ ^> where the supremum norm 
||-||^ is defined on M**. 

Proof. For each data point x^'^' G X there exists a sequence of distinct points (^^n^^ such that 
zL^'^ e a:; and z'a'^ xC^) in R'' as n -> oo. 

Set Zn — I '^"^ I and let Sa (Zn) be the Approximate smoother generated by Z^. Then Z^ — > X 
as independent data and by Theorem 185, Je [sa (Zn)] Je [se]- 
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But since Z„ C X^, we have Je[sa{X^)] < Je[sa{Zn)]. Also from the definition of Sg we have 

Je[Se]<Je[Sa TO]- ThuS 

Je [Se] < Je [Sa [X'^)] < Je [Sa [Zn)] , 

and so [sa {X'^)] Jg [sg]- To establish the convergence of Sa (X!^) we use part 1 of Theorem 4.12 
which can be written 



1 ^ 

Je [Se] + P \\Se ' f\\l,o + ^ K i'^^'^) ^ 



fe=l 



Je [f] 



for all / e X". Choosing / = Sa (X^) wc sec that Us,. - Sa (X,'J||^^o 
Finally, if is the Riesz representer of the functional f ^ f {x) 



0. 



\Se (x) - Sa {X'J {x)\ = (Sg - Sa {X'J , Rx)^^o ^ ll^e - Sa {X'^ 

= \\Se-Sa (X;)||^_o /MO), 

with the right side independent of x. ■ 

We can remove the constraint that the sets X!^ are regular, regular grids: 

Corollary 187 Suppose fl is a bounded, open connected set. Suppose Se is the Exact smoother generated 

by the data [X:y] and that X C ^l. Suppose also that we have a sequence of sets X^ C such that 
maxdist (x, X'J 0. Finally let Sa (X^'J denote the Approximate smoother generated by X'^ and [X] y\. 

Then \\sa {X'^) — Se\\^ o ^ '^'^'^ W^a {X'^) — Selloo ~^ where the supremum norm \\-\\^ is defined on 



Proof. Let X = {x^^^^^^^ Since X c O the assumption that maxdist {x,X'^) — » implies 
dist {x^^\X'^ and we can now use the arguments of the proof of the previous corollary. 



5.6.2 General error results 

The results of this subsection only assume that the weight function has property W2. The next result es- 
tablishes some upper bounds for the pointwise difference between the Exact and Approximate smoothers 
which are uniform on W^. Here p is called the smoothing coefficient and is used to define the Exact 
smoother functional. No data densities are involved. 



Theorem 188 If Se is the Exact smoother of the data X;£xfd 



and Sa is the Approximate smoother 



generated by X;£xfd 



and 



and X' then 



\Sg{x)-Sa {X)\ < \\Se\LoVRo{0), X€ 



,|Se|L_0 ^ ll/rflL,0- 

Proof. From Corollary 176 we have Sa = Ix'Se so that 



(5.16) 
(5.17) 



\Se{x) - Sa {x)\ < \Se {x) - {Ix'Sg) {x)\ = {Sg - Ix' Se, Rx)yjfi 



< II Se -Ix'S,, ' 



= S, 



eWwfi 



V^o (0), 



where the last equality follows directly from the definition of R^- Substituting fa = Ry into 5.16 we 



get 



\S^xRy i^) - SXx'Ry {^)\ < ll^^^ylL n V Ro (0), x,y e 
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From equation 4.15, ||<Si/||^ „ < /MO)min{l, so that 

WSIcRyll^, < ||J?,|L,o l|i?olL,omin{l, ^1 = Ro (0)min|l, ^ 

and hence 

\S3,Ry (x) - S^,x'Ry {x) I < i?o (0) min |l, 
That ||se||^ Q < ||/d||^ follows from part 2 of Summary 174. ■ 

5.6.3 Error estimates derived without assuming unisolvent data sets 

We will now derive order estimates for the pointwise difference between the Approximate smoother and 
the Exact smoother. The derivation of these estimates will rely strongly on the fact that the Approximate 
smoother is the interpolant of the Exact smoother on the set X' (see 5.3) and so we will use the 
convergence results for the minimal norm interpolant derived in Chapter 4. The convergence results for 
the interpolant and Exact smoother all involved one of three assumptions concerning the unisolvency of 
the independent data set X and the weight function w and its parameter k: 

1. w has property W2 for some k > with no assumption that X is unisolvent; 

2. k; > 1 with no assumption that X is unisolvent; 

3. «; > 1 and X is unisolvent. 

We will consider the first two cases in this subsection. 

Type 1 estimates (k > 0) 

The next result is an estimate of the difference between the Exact and Approximate smoothers for an 
arbitrary data function in X° . 

Theorem 189 Suppose: 

1. The weight function w has property W2 and that G is the basis function generated by w. Assume 
that for some s, Cg, ho > the basis function satisfies 

\G{0)-G{x)\<Cg\x\^\ \x\<hG. (5.18) 

2. Let Se be the Exact smoother generated by the data X;£xfd 

3. Suppose that Sa is the Approximate smoother generated by the data X;£xfd dnd the points X' 
contained in K, where K is a closed, bounded, infinite set. 

J;.. LetTx' be the minimal norm interpolant operator on the set X' . 



Then if Ug = (27r) ^ w^^Cg we have the error bound 

\Se (x) - Saix)\< ^ {Se - Sa, Se)^^o^G (/iX'.x)* , X € K, (5.19) 



when hx',K = maxdist {x, X') < Kg- 



Further, -^{se — Sa, Se)^,o — ll*elL,o — ll/<ilL,o '^'^^ order of convergence is at least s in hx',K- 

Proof. Prom Corollary 179 we have Sa = Ix'Se- Now we can apply Theorem 59 which estimates the 
error of the minimal norm interpolant of an arbitrary data function. In this case the data function is Sg, 
the data points are X' , the data region is K, and so for x G if 



\Se (x) - Sa {x)\ = \Se (x) - (Jjf'Se) (a:) | < {Se - 1x' Se, Se)^^QkG {hx^KY 

= \J («e - Sa, Se)^^o^G {hx',Ky , 
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when hx',K < he- Finally, from part 2 of Summary 174, / — Ix' and Sx are contractions and so 

■\/(^e^~^ix^^ej'seX(^ < ^||Se — ^X'Se\\yjfi ||Se||,„,o ^ lke|L,0 — ll/d|lu;,0 • 



Remark 190 

1. Unlike the error estimates for the Approximate smoother and the minimal norm interpolant, this 
estimate for \se {x) — Sa {x)\ does not require the independent data set X to he in a closed, hounded, 
infinite set. Instead, it is X' which is required to he in such a set. The density of X is not explicitly 
involved in the estimate i.e. there is no term involving hx,K o,nd the dependency on X is in the 



formula for Sg and can thus be eliminated hy the approximation y{se — ^x'Sg, Se).^, q < o' 

2. This result confirms the convergence result of Corollary 186. Unlike the Approximate sm,oother 
error estimates, this estimate does not explicitly involve the smoothing coefficient p and has the 
same form as the interpolation error estimates. However, the smooUiing coclficient is part of the 

formula for Sg and can be eliminated hy the approximation yj{se — Ix'Se, Se)^,o — ll/<ilL,o- 

3. The following argument shows the factor ^ [s^ — Sa, Se)^ q in the estimate 5.19 can he calcu- 
lated numerically. Indeed, since Sa = Ix'Se we have ise — Sa,Sa)^Q = {se — Sa,Txi Se)^ q = 

{Ix'Se-Ix'Sa,Se)^^0 = SO that (Se, Sa)„_o = W^^Wlifi "'^'^ 

(se ~ Sa,Se)^Q = y^||se|j^ q — ||sa||^ q. Part 2 of Corollary 177 and part 3 of Theorem 145 can 
then he used to calculate || Sa || ^ q '^'^'^ PelL o respectively. 

As with the Exact smoother error estimates which were valid for arbitrary data functions in X° we 
can try to improve the last convergence result for data functions which have the form Ry, y G K. 

Theorem 191 Suppose we have the same assumptions and notation as Theorem 189 except that now the 
data function is specialized to Ry. Hence SxRy is the Exact smoother and Sx x'^y Approximate 

smoother. 

Further, suppose that the data region K that contains X is a hounded, closed and infinite set. 
Then if hx,K < ho and hx',K < ha we have the estimate 



SxRy {x) - Sx^x'Ry {x)\ < ko {hx',K y[y/^ + kG{hx,Ky + kG{hx',Ky), x,yeK, (5.20) 



where kc = (27r) * V^Cq and Cq satisfies 5.18. 
Proof. Theorem 189 with = Ry yields the estimate 

\S^xRy {x) - S^x'Rv I < kG^{S%Ry - Ix'S-xRy,S-xRy)^^o if^x',Ky , (5.21) 



when hx' ,K < he and x £ K. The term ^ {SxRy — ^x'SxRy,SxRy)^ q will now be manipulated in 
order to try to improve the convergence. Since Xx' is a self-adjoint projection we have 

{SxRy - Ix'SxRy,SxRy)^^o = {{I -Ix')SxRy,SxRy)^j^ (5.22) 

= {{I -^X')SxRy,Ry - {I - Sx) Ry),^ f^ 

= {{I -1x')SxRy;Ry).^„i-, - {{I - Ix') SxRy,{I - Sx) Ry)^^Q 

= {{I - Ix')S'xRy) {y) - {{I - S'x) (I - Ix')S'xRy, Ry)^^, 

= {{I - Ix')S'xRy) {y) - {{I - S^x) {I - ^x') S'xRy) (y) 

<!((/- Tx')SlRy) {y)\ + \{{I-Sl){I- Ix') S^xRy) (y)l • (5-23) 
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The application of the Type 1 interpolation convergence estimate 3.55 to the data function SxRy gives 

\{{I-Ix')S$,Ry){y)\ < {{I -Ix')S$,Ry,S'Mn,fikG{hx',Kr , (5.24) 

when hx',K < ha and y G K. Further, for an arbitrary data function fa e X^, the Exact smoother 
estimate 4.22 implies 

|((7 - S'x) fd) {x)\ < ^{fd-S^xUfd)^^o (^^^ + ''G {hx,Kr) , xeK, 
when hx,K < ha- But in this case fci = {I — Ix')SxRy where x,y & K, and so 

|((/ - S'x) {I - Ix')S'M < yJUd-SyaJd)^^, [VpN + kG {hx,Kr) , 

and 

yjifd - Sxfd,fd)^^Q < ^JIfZfdX^ = {{I - Ix')S^Ry,S'^Ry)y,^o, 

so that 

|((7 - S'x) {I - Ix')S\Ry) {x)\ < ^{{I-Ix')S3,Ry,S-M^,oX 

X + kc {hx,Ky) . (5.25) 

Next using 5.24 and 5.25 to estimate the right side of 5.23 we obtain 

{{I-Ix')S'xRy,S3cRy)^,o < ^{{I -^x-)S\Ry,S-M^fi^G {hx',Kr + 

+ ^J{{I-^x')SS,Ry,S^M^,o {Vp^ + {hx,Kr) , 

so that 

^{{I-Ix')Sj,Ry,SI,Ry)^^„ < + kc ihx,KT + ka ihx',KT , 
and as a consequence of equation 5.22, inequality 5.21 implies 

\SI,Ry {x) - S^x'Ry < kc {hx',KY ^{{I -Ix')S-xR^,S%Ry)^ ,, 

< kc {hx',Ky (ypN + ko {hx,KT + kc {hx',Ky) , 

as claimed. ■ 

Remark 192 Our attem.pt at improving convergence has failed and the order of convergence in hx\K 
is still s. Unlike the cases of the interpolation error in Subsection 2.5.2 and the Exact smoother error of 
Section 4-6 this attempt to improve the estimate for the order of convergence of the Approximate smoother 
to the Exact smoother has resulted in new terms appearing. Essentially the original term kc ihx,KY has 
been squared but the term kc {hx',KY {VpN + kc {hx,KY) has been added. Hence, to get the benefit of 
squaring, the term \/pN + kc {hx,KY should be the much the same size as ka {hx',KY ■ 

The next result gives some idea of how the Exact and Approximate smoothers compare when the 
smoothing parameter p is large. 

Theorem 193 Suppose the assumptions and notation of Theorem 189 hold so that Sg is the Exact 
smoother and is the Approximate smoother of the data function fd- Then for p > 

|Se (X) - Sa (X)| < {Se - fd, fd)^,o {kaf ^ , X G if, /rf G X^, (5.26) 

with J (se - fd, fd)^fi < ||/dlL,o- -^^'^ 5«^'en P the order of convergence is 2s. 
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Proof. From part 4 of Summary 174 



Se = £*x{NpI + Rx,xr^£xfd, 

so that 

(Se - Sa,Se)^o = " Sa,£x {N pi + Rx,xT^ £x fd) 

= (Sx {Se - Sa) , {Npl + Rx,x)~^ £x fd) 
< \£x (Se - Sa)\ \{NpI + Rx,x)~^ £xfd 

But from part 2 Lemma 142 

||Se - /d||^,0 = {Se- fd,fd)^fi- pN {pNI + Rx,x)~^£xfd 



so that 



{pNI + Rx,xr^£xfd\ < {pNr'^^^{se-fd,fd)^,o- 



Hence, if |se - s^l = max \se {x) - Sa {x) 



(Se - •Sa,Se)^,o ^ ( 



{Se - fd, fd)^n {pN} 



-1/2 



<VN\Se- SaL,K ^J (Se - /d, fd)u,fi (P^) 
= \Se — Sa|(x),K ^ (Se — fd, fd) 



and by 5.19 
which impHes 



\Se - Saloo.if < ^ l^e " SaL.K ^ (^e " fd, fd)^^oP^^^HkG {hx',K)') 
= - Sa\oo,K\l{Se - fd, fd)^fiP~^^^kG {hx',K)' , 



and hence 



\Se - Sa|oo,X < J (Se - fd, fd)^,^ (kc) 



2 {hx',Ky 



VP ' 



which impHes 5.26. 



(5.27) 



RemEirk 194 Theorem 193 suggests that as the smoothing parameter p increases the Approximate 

smoother converges to the Exact smoother. Further, for any 'large ' value of p the rate of convergence in 
hx',K is at least {hx^x)^" ■ Of course, as the smoothing parameter increases the Exact smoother gets 
further from the data function. 

The next result summarizes the estimates of |se {x) — Sa {x)\ and \SxRy {x) — Sx x'-^y {^)\ have 
derived above. 

CoroUciry 195 For an arbitrary data function fd we have the error estimates 

ll/dL.o kc [hx'^xY , 



\Se{x) - Sa{x)\<umi\ WfdWwfl V (0), 



X G K, 



(5.28) 
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and for the Riesz data functions Ry we have the error estimates: 



{ ka {hx',KY + ka {hx,Ky + ka {hx'^Kf) , 

Ro (0)min{l,^}, 



\SxRy (x) - S% x'^v (^)| ^ ' 



ike) 



2 (fex'.g) 
VP 



(5.29) 



{kaf %^^P.IL,;. + i?o(0)min{l,^}. 



for x,y ^ K. 



Proof. The inequalities 5.28 are the estimates 5.16, 5.19 and 5.26 simphfied using the inequahties 

^{Se - Sa,Se)^^o ^ ll./rflL,0 ^nd ^ [s^ - fdjd)^fl < IL/dL^O' 

Regarding the inequahties 5.29: The first inequahty is 5.20 and the second inequality is 5.17. When 
fd = Ry is substituted into 5.26 we get fov x G K 



\S^xRy (x) - S^x,x'Ry < {kef ^^""'p ,Jss,Ry{x)-Ry{x), 



and then substituting the estimates 5.42 for SxRy {x) — Ry (x) immediately gives the remaining three 
inequalities. ■ 



Type 1 examples 

The weight function examples used here are those used in the interpolation and Exact smoother doc- 
uments i.e. the radial shifted thin-plate splines, Gaussian and Sobolev splines, and the tensor product 
extended B-spline weight ftinctions, augmented by the tensor product central difference weight functions 
from Chapter 3. 

When the smoothing coefficient p is zero the error estimates of this subsection are the Type 1, non- 
unisolvent, interpolation estimates of Subsubsection 2.5.2 and consequently the formulas or values for 
SjCchcka have already been calculated and were given in Table 2.1. We give these values here in 
Table 5.1 together with the Type 1 values from Table 3.1 for the central difference tensor product weight 
functions. 





Smootlior couv or.^xHict^ ore 


lor (>siimaU>ri 




Type 1 non-unisolvent estimates: k > 0. ka = (27r) ''^^ \/2Cg- 




Parameter 


Converg. 






Weight function 


constraints 


order s 


Cg 


he 


Sobolev splines 


v-i = i 


1 




oo 


2 


2"-ir(i>) 


{v > d/2) 




1 










Shifted thin-plate 




1 


eq. (5.30) 


oo 


{-d/2 <v<0) 










Gaussian 




1 


2e-3/2 


oo 


Extended B-spline 




1 

2 


Gi {Qt-'\\DG,\\^^/dW 


oo 


Central difference 




1 

2 


Gi {Qt-'\\DG,\\^V~d^^) 


oo 


Gi is the univariate basis function used to form the tensor product 


Ky is the modified Bessel function and Ky (r) = r^Ky (r). 



TABLE 5.1. 



\ — 2v 

Cg = IC^rf" + f) (w)| , where f (r) = (1 + rf and w = • (5.30) 
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Type 2 estimates (k > 1) 

These results can be proved by first using the theorems of Subsection 4.6.3 to obtain estimates of the 
form \G (0) — G{x)\ < Co \xf^ and then employing Corollary 195. 

Theorem 196 Suppose a weight function satisfies property W2 for some k > 1 and denote the basis 
function by G. Then the smoother error estimates of Corollary 195 hold for 

CG = -^(v2G)(0)d, s = l, /iG = 00, fcG = (27r)-^ ^-{W^G) (0)Vd. (5.31) 

However, if the weight function is radial we can use the estimates of: 

Theorem 197 Suppose a radial weight function satisfies property W2 for k = 1 and denote the (radial) 
basis function by G. Set r = \x\. Then: 

1. If G {x) = / (r^) the smoother error estimates of Corollary 195 hold for 

CG = -f'{0)d^ s = l, hG = ^, fcG = (27r)- V-2/' (OK 

2. If G{x) = g{r) then the smoother error estimates of Corollary 195 hold for 

CG = -\g" {Q)d^, s = l, /iG = oo, kc = {2tt)--^ ^-g" (0)d. (5.32) 

If the weight function is a tensor product the following result will be useful: 

Theorem 198 Suppose a tensor product weight function satisfies property W2 for /t = 1 and denote the 
univariate basis function by Gi. Then the smoother error estimates of Corollary 195 hold when 

CG = ~Gi{0f-'D^G,{0), s = l, hG = oo, kG = {27r)-i sj -G^ {of-'D^G^ {0)Vd. 

Type 2 examples 

When the smoothing coefficient is zero the smoother error estimates of the Theorems of the previous 
subsubsection become algebraically identical to the Type 2 interpolant error estimates of Chapters 2 and 
3. Further, the weight function examples used above were also used for the Type 2 interpolation examples 
of Chapter 2. If we use the 'radial' Theorem 197 to do the estimates for the radial basis functions and 
Theorem 196 to do the estimates for the tensor product basis functions then the values for s, Cq, he, 
kG will match those obtained for the interpolants. These are given below in Table 5.2 which is Table 2.2 
augmented by the results for the central difference tensor product weight functions from Table 3.1. 

When p = we see that the order of convergence is at least 1 for an arbitrary data function and at 
least 2 for a Ricsz representor data function, no matter what value k takes. However, in the next section 
we will show that by assuming the independent data is unisolvent of order k > 1 it follows that an order 
of convergence of at least [k\ can be attained for an arbitrary data function and an order of convergence 
of [2kJ for a Riesz representer data function. 

5.6.4 Error estimates using unisolvent data sets 

Unisolvent sets were introduced in Subsection 2.5.3 where they were used to define the Lagrange inter- 
polation operators V and Q — I — V. Lemma 72 supplied the key interpolation theory and I reproduce 
it here: 

Lemma 199 ( Copy of Lemma 72)Suppose first that: 

1. Cl is a bounded, open, connected subset o/M*^ having the cone property. 

2. X is a unisolvent subset of of order k. 
Now define 

hx,n = sup dist (w, X) , 
wen 
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Smoother convergence order estimates 


Type 2 non-unisolvent estimates assuming k> 1 


Weight function 


Parameter 
constraints 


Converg. 
order 


\d/4: 1 / 
(27r) ^ kcHd 


Sobolev sphnes 
[v > d/2) 


v-l>2 


1 


lT{v-d/2-l) 

V 2<i/2+ir(«) 


Kv-^ <2 


1 


/ r(t,-d/2-i) 

V 22"-d/2-3r(v) 


Shifted thin-plate 
{-d/2 <v<0) 


■ 


1 


^J-2v 






1 


\ — 


Extended B-spUne 
(1 < n < 


n > 2 


1 


^-Gi {Qf-^D^G^ (0) 


Central difference 
(1 < n < 


n > 2 


1 


^-Gi {Qf-^D^G^ (0) 


Gi is the univariate basis function used to form the tensor product. 



TABLE 5.2. 

and fix X e X. By using Lagrange polynomial interpolation techniques it can he shown there are 
constants cn,K,/io,K > such that when hx,n < ^n,K there exists a minimal unisolvent set A c X 
satisfying 

diam {A U {x}) < cn.s/ix.n- 

Further, suppose {Ij}^^ is the cardinal basis of P,^ with respect to a minimal unisolvent subset of VI. 
Again, using Lagrange interpolation techniques, it can be shown there exists a constant Kq ^ > such 
that 

M 

i=i 

for all X G ft and all minimal unisolvent subsets offl. 

To derive our estimates for \se {x) — Sa {x)\ we will rely on the formula Sa = Ix'Se of part 2 Corollary 
176. Hence we will also need the minimal norm interpolation convergence results of Subsection 2.5.3. We 
are now ready to state our order of convergence result for an arbitrary data function in X^ : 

Theorem 200 Suppose: 

1. w is a weight function with property W2 for parameter k > 1 and let G he the basis function 
generated by w. Set m= [kJ . 



2. Se is the Exact smoother generated by the data 



X;£xfd for some data function fa € X^. 



3. Sa is the Approximate smoother generated by 



X : Exfd 



and the set X' C fi. 



4- Tx' is the minimal norm interpolant operator on the set X' . 

5. We use the notation and assumptions of Lemma 199 which means that X' is n-unisolvent and 
is a hounded, open, connected set whose boundary satisfies the cone condition. 



(27r) 



a/2 
■372 



{cq.,kY^ K'q^^ max |£)^^G(0)| and hn,m such that 



•|/3|=m 



Then there exist positive constants ka 

\Se {x) - Sa (x)| < ^ {Se - Sa, Se)^ q (/iX'.n)™ , X € O, 

when hx',Q = sup dist {lj' ,X') < ho- 

The constants cn,m, K'^ ^^ and hn^m only depend on fl,m and d. Further, 

^{se - Sa,Se)^ < ||se||^_o ^ ll/rflL,o ^'^'^^'^ of Convergence is at least [k\ in hx',n- 



(5.33) 
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Proof. From part 2 Corollary 176, Sa = Ix'Se and by the interpolation estimate 2.50 there exists a 
constant ka such that 



|Se {x) - {Ix'Se) {x)\ < ^(Sg - Xjf/Se, Se)^_o^G (/iJC'.o)'" , X G ^, (5.34) 

when hx',n < hn,m- From Theorem 189, ^{se - X^'Se, Se)^,o ^ ll*elL,o ^ ll/dlL,o o'^'l^'^ 
of convergence is at least m. ■ 

Remark 201 The estimate derived in the last theorem has its dependency on the data points X contained 



in Be in the expression y (sg — Sa, Se)yj q- 

We now try to derive an improved convergence result from the last theorem for a data function which 

has the form Ry. 

Theorem 202 Suppose we have the same assumptions and notation as Theorem 200 except that now 
the data function is specialized to Ry, so that SxRy is the Exact smoother of Ry and Sx x'^y 
Approximate smoother of Ry . 

Further, suppose X is a unisolvent subset of Q. of order k, and suppose that the data region fl that 
contains X is a bounded, open, connected set which satisfies the cone condition. Set m= [/tj . 

Then if hx,K < /iQ,m o-nd hx',K < /iQ,m we have the estimate 

\S'xRy (x) - S^^x'Ry < ka {hx',nr + (/ix,a)™ + ka (/ix'.o)'") , (5.35) 

for all X, y G O, where kc = ,f\i% (co.k)" i^o ^ niax iD'^fG (0)1 . 

y^^' ' \0\=m 

The order of convergence is at least m in hx',n- 
Proof. Theorem 200 with data function Ry implies 



\SxRy (x) - S^^x'Ry (x) I < ka^ {s^^Ry - 5« {hx',nr , (5-36) 



for X € when hx'si ^ 



The term . [S^xRy - S^. x-Ry^S^xRy] will now be manipulated to try to improve the convergence. 

y \ ' / wfi 

Indeed, from inequality 5.23 

{SxRy - Sx^x'Ry'^xRy)^^o = ii^ ^ ^x') SxRy, SxRy)^ ^ (5.37) 

< m-Jx')s^xRy)iy)\ + 

+ !((/- 5i) (/ - Ix')S3,Ry) {y)\ , (5.38) 
and using 2.50 we estimate the interpolation error to be 

\{{I-Xx')SScRy){y)\ < ^{{I-Ix')S$,Ry,S^xRy)wfi {hx',nr , (5.39) 

when y Gfl and hx',Q < ha^m, and using 4.31 we estimate the Exact smoothing error for an arbitrary 
data function fd £ X^ to be 

\fd {y) - {S'xfd) {y)\ < ^J{fd-S^xfd^fd)^fi (K'^^,^ + kG {hx,ar) , yen. 

Consequently, when fd = [I — Ix')SxRy 

|((/ -SI,) {I- Ix')S'xRy) {y)\ < ^{fa-SyaJd)^^, (K'^^^^+kc {hx,ar) , 

and 

yJ{jd--sijdJdy^Q < {fd, fd)^fi = ll/dlL,o = W -^x')SxRy\\y,^Q 

= -'^X')Sl,Ry,Sl,Ry)^ Q, 
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so that 

|((/ - <Si) (/ - Ix') SScRy) {y)\ < ^{{I-Ix')Sj,Ry,SI,Ry)^^, (k'^,,^ + ka {hx^n)"') ■ (5.40) 
Next, using 5.39 and 5.40 to estimate the right side of 5.38 we obtain 

((7 - Ix')SScRy,S3,Ry)^^, < ^i{I-Ix')S3cRy,S3cRy)^,o {^k^VpN + kc (/ix.n)") + 



+ ^{{1 -Ix')S^^Ry,S^M^,okG {hx-,nr 

for J/ e O so that 



^{{I-Xx')S-^Ry,S-M^fi < K'n,.VpN + ka {hx^nT + kc (/ix'.n)" , 
and as a consequence of equation 5.37, inequahty 5.36 impHes 



|se (y) - sa {y)\ < ka 

= ka {hx',nr ^J{{I-Ix')S1,Ry,S'^M^fi 

< kc {hx',nr {K'a,.VpN + kG {hx,nr + kc {hx',nr) , 

as claimed. ■ 

Remark 203 This is the same order of convergence in hx',n that was obtained in Theorem 200 for an 
arbitrary data function, namely m= [kJ . 

5.7 Type 1 Exact smoother error estimates {k, > 0) 

In order to estimate the pointwise error of the Approximate smoother we will need the following Type 
1 error estimates of the Exact smoother. 

Theorem 204 (Copy of Theorem 155) Suppose K is a bounded, closed, infinite set and there exist 
constants Co, s, ho > such that 

G{0)-G {x) < Cg \x\^' , \x\ <hG,xe K. 



j_ 

Set kc = (27r) * \J2Cg- Then the Exact smoother S^fd satisfies the error estimates 
\fdix)-{S'xfd){x)\<mm< 



i \\ML,oi^ + kG{hx,Kr), 

ll/dlL.o (0)' (5.41) 
, ll/rf|loo,K + ll/<ilL,ov/Wmin{l,«f)}, 



and S^Ry satisfies the double order convergence estimate 
\Ry {x) - {S^xRy) < min < 



{^+kG{hx,Ky) , 

Ro (0) , (5.42) 
. P.IL,;, + i2o(0)min{l,«fl}, 



for x,y G K, when hx,K = max dist {s,X) < Kg- 



Here Rq (0) = (27r) ^ G (0) and X is an independent data set contained in K. 

5.8 Type 2 Exact smoother error estimates > 1) 



These will just be the Type 1 estimates of the previous section but with the constants from the most 
appropriate result given in Subsection 4.6.3. 
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5.9 Convergence of the Approximate smoother to the data function 



In Section 5.6 several estimates were derived for the difference between the Approximate smoother .s^ and 
the Exact smoother Sg and a copy of the estimates for the Exact smoother error obtained in Chapter 4 was 
presented. In this subsection we will combine these estimates by means of the simple triangle inequality: 
\.fd (x) — Sa < \ fd {x) — Se + \se {x) — Sa {x)\, whcrc fd IS St data fimction. The convergence results 
of Subsection 5.6.1 do not involve orders of convergence so they will not be used. 

5.9.1 Order-less convergence estimates 

The following result is a simple consequence of Corollary 187: 

Corollary 205 Suppose: 

1. fl is a bounded, open, connected set and fd € is a data function. 



2. si''^ is a sequence of Exact smoothers generated by the data Xk;£xkfd 

3. si''^ converges uniformly pointwise to fd onQ. 



with Xk C fl. 



4- si*^'' is a sequence of Approximate smoothers generated by X',^ c fl and [Xk; fii{Xk)] with the X'f. 
satisfying maxdist {x,X'u) — > 0. 

Then Sa converges uniformly pointwise to fd on 



5.9.2 Error estimates without assuming unisolvent data sets 
Type 1 estimates (k > 0) 

The next result gives estimates for an arbitrary data function and a Riesz data function Ry. 
Theorem 206 Suppose: 

1. the weight function w has property W2 for k = and that G is the basis function generated by w. 
Assume that for some < s < 1 and Cg, Hq > the basis function satisfies 



G (0) - G {x) < Cg \x 



2s 



\x\ < hG- 



(5.43) 



2. Let Se be the Exact smoother generated by the data 



X;£xfd for some data function fd S X^. 
The independent data X is contained in a data region K which is a closed, bounded, infinite set. 

3. Suppose that Sa is the Approximate smoother generated by the data X;£xfd o,nd the points 
X' c K. 

_ d 

Then if kc = (Stt) ^ v2Cg we have the estimates 



r \\fd\L,o {VpN+kG {hx,Kr + kG {hx',Kr) , 

2||/d|Lo v/^Mo), 



\fd{x)-Sa {x)\ <min< 



(5.44) 



+ 



ll/c^lL,oV^min{l,«fi}. 



and 



kG {hx',KT {Vp^ + kG {hx,KY + kG {hx',KY) , 
\Ry (x) - Sl^,Ry (x) I < r?e (p) + min <| (0) min {l, , 

ikaf 



(5.45) 



■y/Ve (p)- 
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where 

( {VpN + kG{hx,Kr)\ 

r]e (p) = min < Ro (0) , 

i IKIU,K+i?o(0)min{l,M}. 

when hx K = maxdist {x, X) < ha and hx' k = maxdist (a;, X') < ha- 

Proof. From Theorem 204 and Corollary 195, 

l/dlL.o i^pN + ka {hx,KY), 

|/d(a;)-Se(a;)| <min<j ll/<iL,o V^o (0), 

oo,x + ll/<ilL.o\^MO)niin{l,^} 



(5.46) 



and 



|se (x) - Sa {x)\ < mm < 



f \\ML,okG{hx',Kr, 
ll/dlL,0 



so that 



\fd (x) - Sa {x)\ < \ fd (x) - Se {x) \ + \Se (x) - Sa {x)\ 

{ ll/dL {V pN + k G {hx,KY + kc {hx'^KT) : 

-""'"^ ll/.lloo.K + ll/<^IL.o(M'^^^^+ 

+ ll/<^IL.o^/^min{l,iM2)|. 



Using T] (p) the estimate 5.29 for |«S^-Rj/ (x) — x'^y (^) | t)e written 



\SxRv {x) — Sx x'Ry ix)\ < min < 



f ka {hx',Ky {y/pN + ka {hx^KY + ko {hx',KY) 
i?o(0)min{l,M}, 

{kcf^-^^^^i^ + kdhx^KY), 



(ka) 



o(0)min{l,iMo)}. 



< min 



2 {h^,^,y 
(kcY %^ 

kc {hx',KY {VpN + fee {hx,KY + ka {hx',KY) 
i?o (0)min{l,^}, 



Now 



\Ry (x) - S''x,x'Ry {x)\ < \Ry (x) - {S'xRy) {x)\ + (x) - S'^x,x'Ry (x) 



and since the estimate for \Ry (x) — {S^Ry) {x) \ from 5.42 can be written \Ry (x) — {S^Ry) (x)| < 77 (p) 
wc have 5.45. ■ 

Remark 207 Define ha = {{hx,KY + {hx'^KYf'"- Then: 
1. Prom 5.44 

\fd{x)-Saix)\ < \\fd\L,o{VpN + kG{haY), X G K, 

and the order of convergence is s in ha- 
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2. From 5.45: for x,y € K 

\Ry (x) - Sl^x'Ry < (ypN + ka {hx,Ky)' + \^kG {hx',KT + 

+ {kef {hx',Ky {hx',Kr + {kef {hx,Kf' 
< + ko {hx,Ky + kc {hx',KYy 

and the order of convergence is 2s in ha ■ 

3. The last two Approximate smoother error estimates are analogous to the Exact smoother error 

estimates 

\h {x) - {S^xh) {x)\ < mL,n {VpN + kG ihx,Kr) , 

and 

\Ry {x) - {S$,Ry) (x)| < (y^ + kc {hx,Kr)' , 

from 5.41 and 5.42. 
Type 2 error estimates (k > 1) 

Wc want to derive Type 2 error estimates for \fd (x) — Sa {x)\. In fact, these results can be proved by 
combining the Type 2 Exact smoother error estimates from Subsection 4.6.3 with the Type 2 estimates 
for |se (x) — Sa {x)\ from Subsubsection 5.6.3. 

Theorem 208 Suppose a weight function satisfies property W2 for some k > 1 and denote the basis 
function by G. Then the Approximate smoother error estimates 5.28 and 5.29 hold for 

CG = -^{V^G){0)d, s = l, /iG = 00, fcG = (27r)-V-(V2G) {0)Vd. 

However, if the weight function is radial we can use the estimates of: 

Theorem 209 Suppose a radial weight function satisfies property W2 for k — 1 and denote the (radial) 
basis function by G. Set r = \x\. Then: 

1. If G {x) = f (r^) the Approximate smoother error estimates of Theorem 206 hold for 

CG = -f'{0)d^ s = l, /iG = oo, fee = (27r)-^ V-2/' (0)^ 

2. If G {x) = g (r) then the Approximate smoother error estimates of Theorem 206 hold for 

CG = -\g"{0)d^ s = l, ha = 00, fee = (27r)-^ y^P^d 

If the weight function is a tensor product we can use the estimates of: 

Theorem 210 Suppose a tensor product weight function satisfies property W2 for k = 1 and denote the 
univariate basis function by G\ . Then the Approximate smoother error estimates of Theorem 206 hold 
when 

CG = -'^Gi{Qf-^D^G^{Q), s = l, he = 00, kG = {2n)-i ^f^G^A^f-^D^G^)Vd. 
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5.9.3 Estimates using unisolvent data sets 

The next result is an error estimate for an arbitrary data function in X° . 
Theorem 211 Suppose: 

1. w is a weight function with property W2 for some k> 1 and let G be the basis function generated 
by w. Set m= [kJ . 

2. Se is the Exact smoother generated by the data X; £xfd for some data function fa G X^, and X 
is contained in the data region fi. 



3. Sa is the Approximate smoother generated by 



X'l £xfd 



and the set X' ci^. 



4- We use the notation and assumptions of Lemma 72 which means here that X' and X are m- 
unisolvent and that fl is a bounded, open, connected set whose boundary satisfies the cone condition. 

Then there exist positive hn,m, CQ,m, K'q ^ and ka = ('^^Lii (cn.m)'" -f^n m \lfi^G (0)1 such that 

\fd {X) - Sa {X)\ < ^ifd-SeJd)^,o {K'^,mVp^ + kc (/iX.o)'") + 



+ ^J{s^-^sZse)^okG {hx',nT' , a; e O, 



when hx,K = sup dist (a;, X) < ha and hx',K = sup dist (a;, X') < ha- 



Further,^(se - Sa, Se)^,o < l|selL,o ^ ll/dlL,o and ^ {fa - Se, /d)^,o ^ ll/dlL,o- 
Proof. Prom Theorem 200 



\Se (x) - Sa{x)\< ^ (Se " Sa, Se)^,o^G (/iX'.o)™ , X G il, 

when hx',n < hn,m- From inequahty 4.31 

\fd (X) - Se {X)\ < ^(/d-Se,/dL,o [K^m\^ + ka (/iX.o)'") , X €n, 

when hx,n < hn^rn- Hence when .x G f2 

\fd {x) - Sa {x)\ < \ fd (x) - .Se (.x)| + |Se (x) - So (a;) | 

< ^Jifd-Se,fd)^,o (K'a,m^ + kG (/ix.o)") + 

+ y^(Se - Sa,Se)^^QkG (/iX'.o)" • 

The final inequaUties follow from Theorem 200 and Corollary 153. ■ 
Remeirk 212 We can combine the two density measures hx,n and hx',n into one measure 

ha = {{hx,nr + {hx-,nrf'"'. 



From Theorem 206 we have ^{fd - Se,/d)^,o ^ ll/dlL,o '^"^ \/i^e - Sa,Se)^,o ^ ll/dlL,o 



that 



\fd (X) - Sa {X)\ < ^{f,-SeJd)^^o (^0,™VM^ + (/^X.o)") + 

+ ^J{s^^^7^^^7e)~QkG {hx',n)"^ 

< \\fd\L,o {K'n,mVpN + kG {hx,nr + ka (/ix'.n)™) 

< \\fdL,0 {K'n^mVpN + kGihar), 

SO that when p — the order of convergence in ha is m — [kJ . 
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An important application of the Approximate smoother is the case where X' is a regular rectangular 
grid. To apply the last theorem we must show that if the grid size decreases then X' eventually contains 
a TO-unisolvcnt subset. The next theorem confirms this. The result of the next theorem is obvious in one- 
dimension but results for polynomials in higher dimensions can be complex and/or unexpected. However, 
here things are OK. We will need the following lemma: 

Lemma 213 We have the following unisolvency results: 

1. The 5ei {7 e Z'' : < 7 < n} is unisolvent w.r.t. Pn- 

2. Translations of minimal unisolvent sets are minimal unisolvent sets. 

3. Dilations of minimal unisolvent sets are minimal unisolvent sets. 

Proof. Part 1. Prom the definition of unisolvency, Definition 66, we must show that for each p e P„, 
P (7) ^ for < 7 < n implies p = 0. The proof will be by induction on the order of the polynomial. 
Clearly the lemma is true for n = 1 since Pi is the constant polynomials. 

Now assume that n > 2 and that if p G Pn and p (7) = for < 7 < n then p = 0. Set p (x) = 
J2 CfjX^ . Then if 7/j = and 7, = 1 when i ^ k then = J2 c/37'^ implies C/3 = when f3k = 0. 

\m<n \0\<n 

Thus p (7) = for < 7 < n implies C/j = when Pi = for some i. Consequently, p = if n < d else 
P{^) = Yl CfjX^ ■ If n > we can write p {x) = x^q {x) where q G Pn-i, so that g (7) = for 1 < 7 < n 

\0\<n 
f3>0 

must imply g = 0. Finally, if we define r G Pn-i hy r (x) = q {x + 1) then r (7) = for < 7 < n — 1 
must imply r = 0, and so the truth of our lemma for n — 1 implies the truth of the lemma for n, and the 
lemma is proved. 

Parts 2 and 3. From the definition of a cardinal basis. Definition 68, there is a unique cardinal basis 
{li} of Pn associated with a set A = {ui} iff A is minimally unisolvent of order n. Now by definition 

h (cij) = Sij. Hence, if t,S G R'' and 5 has positive components, then the cardinal basis associated with 
the translation ^ + t is {li (• — r)} and the cardinal basis associated with the dilation SA is {k {-/S)}. ■ 

Theorem 214 Suppose X' = {x'^ = a + ha \ a gZ'^ and < a < Af'} is the regular, rectangular grid 
introduced in Definition 170. 

Then X' is m-unisolvent if M' > m. 

Proof. Since {X' — a) /h = {a \ a G Z'^ and < a < A/"'} and Af' > m, part 1 of the lemma implies 
{X' — a) /h is m-unisolvent and thus from the definition of unisolvency. Definition 66, {X' — a) /h must 
contain a minimal unisolvent subset. Parts 2 and 3 of the lemma imply that X' contains a minimal 
unisolvent subset and so X' is unisolvent. ■ 

We can improve the last convergence result when the data functions have the form Ry. 

Theorem 215 Suppose: 

1. w is a weight function with property W2 for some k> 1 and let G be the basis function generated 
by w. Set m= [k\. 

2. S^Ry is the Exact smoother generated by the data X;£xRy for some data function Ry, and X 
is contained in the data region Cl. 



Sx^x'^y Approximate smoother generated by X;£xRy 

data region Cl. 



id the set X' is contained in the 



4. We use the notation and assumptions of Lemma 72 which means here that X' and X are m- 
unisolvent and that fl is a bounded, open, connected set whose boundary satisfies the cone condition. 

Then there exist positive hn,mi cn,m, K'q ^ and kc = ,^ Li-i (cn,m)™ -K^q m \D'^^G (0)1 such that 

' y^^l ' ' |/3|=m 

\Rv (x) - Sx,x'Ry (^)l < {K'a^„,^/pN + kc {hx,ar + ka {hx',^)^)^ , 
for x,y ^0,, when hx,n = sup dist (x, X) < he and hx',Q = sup dist (x, X') < he- 
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Proof. When hx,Q. < /if2,»n and hx'.n l£ ^J2,m Theorem 202 implies 

\S3,Ry {x) - SScx'Rv W I < k'a {hx'^nT (^n,™ Vp^ + kc {hx^nT + kc (/ix'.n)'") , 
for X gQ, and 5.42 implies that 

\Ry{x)-SS,Ry{x)\< (k'^^^^ + kG{hx,ar)\ x,y &Ti. 
Hence when a; e O 

\Ry {x) - S^^x'Ry (x) I < \Ry i^) - S^Ry {x)\ + \SI,Ry (x) - S%^x'Ry W | 

< (^Ki,^^^ + kG{hx,nry + 

k'c {hx',nr {K^^m^+kG (hx.nr + ka 

< (a:^_„v^+ /CG {hx,nr + ka (ftx'.n)™)' • 

■ 

Remeirk 216 We have 

\Ry (x) - S%^^,Ry {x)\ < (k'^^^^ + Ug (/ia)™)' , x,yeTl, 

where ha = {{hx,n)™ + (/ijc.o)™)^^™ so that when p = the order of convergence is 2m = 2 [kJ . This 
compares with order m = [k\ obtained in Remark 212 in the case of an arbitrary data function. 

5.10 Numerical experiments with scaled, extended B-splines 
(non-unisolvency) 

In this section the convergence of the Approximate smoother to its data function is studied numeri- 
cally using scaled, extended B-splines, the first of which will be the hat function. We will use the same 
1-dimcnsional data fimctions and B-splinc basis fimctions 4.37 that were used for the numerical inter- 
polation experiments described in Section 4.8. We will only consider the numerical experiments in one 
dimension to allow the data density parameter hx k = maxdist {x, X) to be easily calculated. 

It is easier to validate the non-unisolvent error estimates i.e. those derived without assuming unisolvent 
data sets. This is because in the non-unisolvent case the constants can be known precisely. In the case of 
unisolvent data the theory is complex and it is unclear what are suitable upper bounds for the constants 
^'Q,m' cn,m, hn^rn in Theorem 211. 

Because all the extended B-spline basis weight functions have a power of sin x in the denominator we 
will use the special classes of data functions developed in Section 2.6. The multivariate theory of local 
data spaces X° (fi) where is a bounded open set was developed in Sections 2.7 and 3.5. 

The 1-dimensional data set is constructed using a uniform distribution on the data region K = 
[-1.5,1.5]. 

We now assume that X' is a regular grid so that 

hx',K = (5.47) 

where N' >2 is the number of grid cells. 

Each of 20 data files is exponentially sampled using a multiplier of approximately 1.3 and then we plot 
logiQ hx,K against log^g.^ where N = \X\. It then seems quite reasonable to use a least-squares linear 
fit and in this case we obtain 

hx,K ^ SmN-°-^\ (5.48) 

For ease of calculation let 



hx,K = hxN-", hi = 3.09, a = 0.81. (5.49) 
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Noting the error estimates of Theorem 206 we use 5.49 to write hxK = ^-^^^ define the Exact 

smoother error estimates 

\U{x)-{S-xf<i){x)\<Ej,{p), xGK, (5.50) 



and 
by 



\Ry{x)-{SI,Ry){x)\<eN{p), x,y€K, (5.51) 



En {p) = min < 



ll/rflL,o2\ARo(0), ^ ^5_g2) 

ll/d|l<x>,K + ll/dlL,o(^G) (w) ' 7^ + 



+ 



ll/aL,ov/^min{l,i^}. 



and 



£jv (p) = r/e (p) + min <j i?o (0) min |l, M| , (5.53) 



where 

r?e(p)=min<^ i?o (0) , (5.54) 
i ||ii.lU,^ + i?o(0)min{l,M}. 

5.10.1 Extended B-splines with n — 1 
The case n = 1, Z = 1 

The l-dimensional hat weight function w\ is given by w\ (^) = ^j^rr ^md was discussed in Subsections 
1.2.1 and 1.2.6. This is a scaled, extended B-sphne basis function corresponding to n = i = 1 and thus 
Theorem 7 and part 6 Remark 2 imply k < 1/2. 

Now suppose n is the 1-dimcnsional unit- valued rectangular function with support [—.5, .5]. Then it 
was shown in Subsubscction 2.6.1 that: 

Theorem 217 If u £ (M^) then u*Il is a data function i.e. u* 11 e where wa is the hat weight 
function. Further, if we define V € Lj^^ fl Cgp by 

V{x)= [ u (t) dt, (5.55) 
Jo 

then DV = u a.e., 



ll«*n||^.,o = (27r)"^IHl2- 

To obtain our data function = u * 11 we will choose 

u{x) = e"'^^, 

for which 

y=(27r)^erf, = 2 (27r)^ ||u * n||^^ ^ = 2, (5.56) 

and 

fd{x)=u*Il (x) = erf (^+1)- erf - ^) • (5-57) 
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Since we are using the hat basis function, from Subsubsection 2.6.1 

G(0) = 1, Cg = 1, s = 1/2, /iG = oo, fee - (27r)-i/^ \/2, 
and from 5.56 and 5.57, 

ll/<ill,«,o = 2. 

Finally, Dfd (x) = {2tt)^ e~(^+^) - (27r)^ e"('^"5) = iff x = so the maximum value of the data 
function on the data region if = [—1.5, 1.5] is 



niax/d = erf 



erfl-- 



1.041. 



For the double rate convergence experiment we will use the Riesz data function Rq = (27r) ^ A. 
Numerical results 



Absolute smoother error vs. Smoothing parameter We start by plotting the absolute error bounds given 



by 5.52, 5.53 and 5.54, together with the actual absolute smoother errors, against the smoothing param- 
eter. To be precise the actual errors are averaged over a regular error grid covering the data region. The 
results for the data function fd are shown in Figure 5.1 and the results for the Riesz data function Rq 
are shown in Figure 5.2: 

Approx smoother error vs Smoothing parameter for various Nos of data points 




o 
i_ 
i_ 
a> 
i_ 

o 
o 
E 

0) 



Averaged actual errors 



Error bounds 



10 10 10 10 10 




Smoothing parameter 

FIGURE 5.1. Data function: erf + ^) — erf Num smoother grid cells is 16. 



Smoother error on Data region As the number of data points increases the smoother error of fd is 



observed to take the form of stable alternating positive/negative spikes as shown in Figure 5.3 which 
corresponds to 30K data points. The positive errors are modulated by a hat function and the zero error 
points have a period of 3/16; note there are 16 cells in the smoother grid. 
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Approx Smoother error vs Smoothing parameter for various Nos of data points 




Smoothing parameter 

FIGURE 5.2. Riesz data function: Rq = h. (Hat tunc). Num smoother grid ceUs is 16. 



It is seen in Figure 5.4 that as the number of points increases the smoother error of the Riesz data 
function _Ro is observed to stabihze to a wide negative central spike surrounded by smaUer spikes of 
about l/5th the amphtude. 

It is clear that, except for smoothing parameters larger than 10, the theoretical error bounds given in 
5.52 and 5.53 capture only a small part of the actual error. The convergence of the error bounds to the 
average spatial errors for large p is typical. The averaging process also disguises the instability indicated 
by Figure 5.5. 



Absolute smoother error vs. Data density Following the numerical work of Chapter 4 which studied 



the Exact smoother we will also apply the 'spike' filter to the Approximate smoother errors. This filter 
calculates the value below which 80% of the errors lie. The filter is designed to remove 'large', isolated 
spikes which dominate the actual smoother errors. The smoother error was calculated on a grid with 200 
cells applied to the domain of the data function. No filter is used for the first five interpolants because 
there is no instability for small numbers of data points. This filter will be used in all the cases below. 

We now move on to examine the relationship between the smoother error and the data density. To 
do this we need to fix a value for the smoothing coefficient and here we choose p = 10~^. We select 
this value because often the Exact smoother average error curve has a minimum near this value. Using 
the functions and parameters discussed at the end of the last subsection we obtain the four subplots 
displayed in Figure 5.5, each subplot being the superposition of 20 interpolants. 

The two upper subplots relate to the data function 5.57 and the lower subplots relate to the data 
function Rq. The right-hand subplots are filtered versions of the actual errors on the right. The (blue) 
points above the actual errors in Figure 5.5 are the estimated upper bounds for the error given by the 
theoretical estimates 5.44 and 5.45. The annotation at the bottom of the figure supplies the following 
additional information: 

Input parameters 

N = L = 1 - the hat function is a member of the family of scaled, extended splines with the indicated 
parameter values. 



xlO 
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Approximate smoother error: 30,000 data points 




Data region. 



FIGURE 5.3. Data function: erf (x- + ^) — erf [x — |). Number of smoother grid cells is 16 



spl scale 1/2 - the actual scaling of the spline basis function is 1/2 divided by spl scale, 
sm parm le-6 - the smoothing parameter is zero for interpolation. 

samp 20 - the sample size which is the number of test data files generated. The data function is 
evaluated on the data interval K — [—1.5, 1.5] at points selected using a uniform (statistical) distribution. 

pts 2:50K - specifics the smallest number of data points to be 2 and the largest number of data points 
to be 30,000. The other values are given in exponential steps with a multiplier of approximately 1.2. 

sm grid cell 16 - number of cells in the regular smoother grid superimposed on the data region K . 

Output parameters/messages 

No ill-condit - this indicates all Exact smoother matrices were always properly conditioned. 

Note that all the plots of error vs. data density given below have the same format and annotations. 

Clearly both theoretical error bounds substantially underestimates the convergence rate. The insta- 
bilities associated with small numbers of data points disappear as the number of data points increases. 
The smoother of i?o is less stable than that of fd- 



The case n — 1, I — 2 

Here we are interested in the 1-dimensional extended B-spline basis function with n = I and 1 = 2. The 
following result, which is Theorem 77 in one dimension, will allow us to generate data functions for which 
the X'^ norm can be calculated This result is closely related to the calculations done above for the hat 
function. 

Theorem 218 Data functions Suppose w is the B-spline tensor product weight function with parame- 
ters n and I and univariate 4-35, and that U (z (R"*^) with D^U G {^^) *^ sense of distributions. 
Then if we define the distribution 

fd = 6iU, ; = 1,2,3,..., 
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Approximate smoother error: 30,000 data points 




Data region 



FIGURE 5.4. Riesz data function: Ro = A (hat tunc). Number of smoother grid cells is 16. 



where S2 is the central difference operator 

d2U = U {■ + !)- U {■-!), 

it follows that fd G X^^ and 

||/,|L,o = 2'P"C/||,. 
Our tensor product basis function G has univariate 4.37 i.e. 

G(t)^(-l)'-",m^(i^^('-") ((*A)'))(|) 
= -l{D^K*K)){\), teRi. 



But 



(A * A) = A * D^A = A * ((5 (• + 1) - 2(5 + 5 (• - 1)) 

(A(. + 1)-2A + A(-~1)) 



so that 



and hence 



G{t) 



DG{t) 



16 



(A(i + 1)-2A(|)+A(A^1)) 



(A'(i + 1)-2A'(|)+A'(|-1)), 



(5.58) 
(5.59) 



i.e. \\DG\\ 



00 16 



27r. Since the (distributional) derivative is bounded the distributional Taylor series 



expansion of Lemma 42 can be used to write 

G{Q)-G{t)<\\DG\\^\t\, xeM\ 



5.10 Numerical experiments with scaled, extended B-splines (non-unisolvency) 



173 



Data fn: erf(x+1/2) - erf(x-1 
1 







12) 



Filtered error 



9 -1 



X _? 



o -3 
5 -4 
-5 




-4 -2 

Log10 data density 



Data fn: Rq(x) 




-4 -2 

Log10 data density 



1 


-1 
-2 
-3 
-4 
-5 




-4 -2 
Log10 data density 

Filtered error 




-4 -2 

Log10 data density 



dim 1, N=1, L=1, spl scale 1/2, sm parm 1e-6, samp 20, 
pts 2:50K, sm grid cell 16, no ill-condit 

FIGURE 5.5. Error of Approximate smoother vs data density. 



which means that 



G(0) = 



, Cg = \\DG,,2\\^ = ^V2^, s - i - oo. 



With reference to the last theorem we will choose the bell-shaped data function 
where 



U{x) 



^2 (e-fci,2^") (0) 2(l-e-4fei.2) 
so that ||/d||^^ = /d(0) = land 



0.3, 



l/dlL.o = 4|lDC/|l, 



1 - e-4fei,: 



(5.60) 



The error estimates are given by 5.44 and 5.45 where kc = (27r) \/2Cg and ||-Ro|loc k ~ -^o (0) 
(27r)"^G(0) = 1/4. 
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For the theory developed in the previous sections it was convenient to use the unsealed B-sphne weight 
function definition 4.35 but for computations it may be easier to use the unsealed version of the extended 
B-spline basis function given in the next theorem. 

Theorem 219 Suppose = (-1)'"" j^^^^y^ 

where A is the 1- dimensional hat function and 

n, I are integers such that 1 < n < I. 

Then for given A > 0, Gg (Xx) is called a scaled extended B-spline basis function. The corresponding 
weight function is wx (t) = 2Xaw (^) where w is the extended B-spline weight function with parameters 

n, I and a = 22fi-n)+i • Indeed, for wx we can choose k = n — 1. Further 

GsiO)-Gs{Xx)<X\\DGs\\^\x\, x€R\ (5.61) 
Finally, if fd G X° and ga (x) = fa (2Ax), it follows that gd € and ||ffdlL^,o = V« ll/dlL,o- 
Numerical results 



Smoother error on Data region: As the number of data points increases the error function of the 

smoother of fd stabiHzcs: it has alternating positive and negative spikes with an overall mean of zero. 
The zeros of the error function have a period of 3/16: note there are 16 cells in the smoother grid. 

As the number of data points increases the error of the smoother of the Riesz data function Rq is 
obscrvcxl to sta1)ilizc: it has a wide iK\c,ati\"c (;cntral spike at a; = surrounded by much smaller values. 



Absolute smoother error vs. Data density: As with the previous case we select p = 10 and, using 
the 'spike filter' of the previous case, we obtain the four subplots displayed in Figure 5.6, each subplot 
being the superposition of 20 smoothers. The two upper subplots relate to the data function 5.60 and 
the lower subplots relate to the Riesz data function Rq. 

The (blue) points above the actual errors in Figure 5.6 are the estimated upper bounds for the error 
given by the estimates 5.44 and 5.45. Clearly both theoretical bounds substantially underestimates the 
convergence rate. The instabilities associated with small numbers of data points eventually disappear as 
the number of data points increases. The smoother of Rq is less stable than that of fd- 

5.10.2 Extended B-splines with n = 2 

Since n > 2 we can use the error estimates of Theorem 210. We will use the same scaled (dilated) B-spline 
and data function that we used for interpolation in 2.6.2 and for Exact smoothing in Subsection 4.8.2. 

The case n — 2, I = 2 



The basis function is 

^ , , (A*A)(2x) 3v^,. ,^ , 
^^-^(^^= (A.A)(0) =^(A*A)(2x), 

with scaling factor A = 4 and is such that suppG2,2 = [—1, 1] and G2,2 (0) = 1. To calculate G2,2 we use 
the convenient formula 

G2,2 (x) = (1 + xf A {2x + 1) + (1 - 2a;2) A (2x) + (1 - xf A (2x - 1) , 

and choose the bell-shaped data function 

fd = 5lU e Xl, (5.62) 

where 



U{x) = -r-, — — — TT-^, fci2=0.3, (5.63) 

^' (5|(e-'=i.2^ ) (0) 2(l-e-4fci.2)' ' v y 



so that 



12 4/=^ [k 



-,3/4 



^^^ = 77^17?' II/'^IU = V727r-^i^^^, fci,2 = 0.3, s = I, /ig = oo. (5.64) 
(27r) i e • 
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For the double rate convergence experiments the data function is Rq = (27r) ^ G2,2 where Rq (0) = 
(27r)"i 

The next step is to calculate H/dHoo k ll-^o||oo k where K = [—1.5, 1.5]. Clearly H-RoHoo k — (0) 
and from 5.62 and 5.63, ||/d|U,K = U (0) = 1- Thus 

ll^olL,K = ^o(0) = (27r)-^ ||/dlL,K = l. (5.65) 

Numerical results 



Smoother error on Data region: As the number of data points increases the smoother error of fd is 
observed to consist of stable alternating positive and negative spikes with mean zero. The zeros of the 
error function have a period of 3/16: note there are 16 cells in the smoother grid. 

As the number of data points increases the smoother error of the Ricsz data fimction Rq is observed 
to stabilize: it has a wide negative central spike at .t = surrounded by much smaller values. 



Absolute smoother error vs. Data density: As in the previous cases we select p = 10 ^ and use the 
spike filter. The output consists of the four subplots displayed in Figure 5.7, each subplot being the 
superposition of 20 smoothers. The bottom subplots correspond to the Riesz data function. 

The (blue) points above the actual errors in Figure 5.7 are the theoretical upper bounds for the 
smoother error given by the estimates 5.44 and 5.45. 

Clearly for the data function fd the theoretical error bound substantially underestimates the conver- 
gence rate. Regarding the data function Rq, the theoretical upper error bound for the data function Rq 
is a rather better fit to the unfiltered errors. The instabilities associated with small numbers of data 
points eventually disappear as the number of data points increases. 

It is interesting to note that the case of 8 smoother grid cells shown in Figure 5.8 is more stable and 
converges to a much smaller error value than the case of 16 smooth grid cells in Figure 5.7. 
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5.11.1 The SmoothOperator software (freeware) 

In this section we discuss a numerical implementation for the construction and solution of the zero order 
Approximate smoother matrix equation 5.13 i.e. 

{NpRx',x' + R^x,x'Rx,x') ol = RJcx'V- 

I called this software SmoothOperator. In Subsection 5.5.2 it was shown that the construction and 
solution of this matrix equation is a scalable process and thus worthy of numerical implementation. This 
software also implements the positive order version of this equation which is derived in Williams [13] but 
the tutorials concentrate on the zero order basis functions. 

The algorithm has been implemented in Matlab 6.0 with a CUI interface but has only been tested 
on Windows. SmoothOperator can be obtained by emailing the author. However there is a short user 
document (4 pages) and the potential user can read this document first to decide whether they want 
the software. The top-level directory of the software contains the file read-me.txt which can also be 
downloaded separately. A full user manual (82 pages) comes with the software. The main features of 
the full user manual are: 

1 Tutorials and data experiments To learn about the system and the behavior of the algorithm, 

I have prepared three tutorials and five data experiments. 

2 Context-sensitive help Each dialog box incorporates context-sensitive help which is invoked using 
the right mouse button. An Fl key facility could be implemented. The actual help text is contained in 
the text file \Help\ContextHelpText.m that can be easily edited. 

3 Matlab diary facility When the system is started the Matlab diary facility is invoked. This means 
that most user information generated by SmoothOperator and written to the Matlab command line is 
also written to the text diary file. Each dialog box has a drop-down diary menu which allows the user to 
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view the diary file using Notepad. There is also a facility for you to choose another editor /browser. The 
file can also be emptied, if, for example, it gets too big. It can also be disabled. 

4 Tools for viewing data files Before generating a smoother you can view the contents of the data 
file. 

For ASCII delimited text data files, use the (slow) View records facility to display records and the 
file header, and then with this information you can use the high speed Study records facility to check 
the records and then obtain detailed information about single fields e.g. a histogram, and multiple fields 
e.g. correlation coefficients and scatter plots. 

For binary test data files only the View records facility is needed. The parameters which generated 
the file can also be viewed using the Make or View data option. 

5 The output data The output from the experiments and tutorials mentioned in point 1 above 
consists of well-documented Matlab one and two-dimensional plots, command line output and diary 
output. There is currently no file output, but this can be implemented on request. 

6 Reading delimited text files A MEX C file allows ASCII delimited text files to be read very 

quickly. You can specify: 

6.1 that the file be read in chunks of records, and not in one go. 

6.2 The ids of the fields to be read. This means that the file can contain non-numeric fields. 

6.3 Fields can be checked to ensure they are numeric. 

The tutorials which create smoothers allow the data, smoothed data, and related functions to be 
viewed using scatter plots and plots along lines and planes. 

7 Please note that there is no explicit suite of functions - application programmer interface or API - 
supplied by SmoothOperator for immediate use by the user. After refiecting on how I would create such 
an API, I decided that I lacked the experience to produce it, and that, anyhow, there were too many 
possibilities to anticipate. I would expect that possible users of this software, designed to be applied to 
perhaps millions of records, would need to familiarize themselves thoroughly with how this system works. 
I urge them to contact me and discuss their application. 

5.11.2 Algorithms 

The following three algorithms are used in the SmoothOperator software package to calculate the Ap- 
proximate smoother. 

Algorithm 1 uses data generated internally according to specifications supplied by the user using 
the interface. The smoothing parameter can be either specified or calculated using an error grid. This 
algorithm is scalable. 

Algorithm 2 uses a 'small' subset of actual data to get an idea of a suitable smoothing parameter to 
use for the full data set. This algorithm is not scalable. 

Algorithm 3 uses all the actual data. The smoothing parameter can be either specified or calculated 
using an error grid. This algorithm is scalable. 

We will now explain these algorithms in more detail. 

Algorithm 1: using experimental data 

1. Generate the experimental data [X, y]. The independent data X itcd by uniformly dis- 
tributed random numbers on X. The dependent data y is generated by uniformly perturbing an 
analytic data function gdat- 

2. Choose a smoothing grid X' whose boundary contains X. Choose an error grid X'^^^ which will be 
used to estimate the optimal smoothing parameter p. 

3. Read the data [X, y] and construct the matrix equation. If the matrix Gx,x' is dense, the matrix 
Gx,x'^x.x' is constructed using a Matlab mex file (a compiled C file). If Gx,x' is sparse the usual 
matrix multiplication is used. 

4. Given a value for the smoothing parameter p we can solve the matrix equation and evaluate the 
smoother. 
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We want to estimate the value of p which minimizes the 'sum of squares' error between the smoother 
and the data function 

h{p)= E {'yp{x")-9dat{x"))\ p>0. 

Empirical work indicates a standard shape for 5i (p), namely decreasing from right to left, reaching 
a minimum and then increasing at a decreasing rate. To find the minimum wc basically use the 
standard iterative algorithm of dividing and multiplying by a factor e.g. 10 and choosing the 
smallest value. The process is stopped when the percentage change of one or both of (p) and p 
are less than prescribed values. 

Algorithin 2: iisiiig a. ■test' suljstU, of actuul data 

1. Perhaps based on results using Algorithm 1, choose an initial value for smoothing parameter p and 
choose a smoothing grid X'. 

2. Step 2 of Algorithm 1. Denote the data by [X,y] where X = (a;^*^) and y = {yi). 

3. This is the same as step 4 of Algorithm 2 except we now minimize 

N 2 
i=l 

because we do not have a data function. 

Algorithm 3: using all the actual data 

1. Choose a value for smoothing parameter p, based on experiments using Algorithm 2. Choose a 
smoothing grid X'. 

2. Read all the data [X, y] and construct the matrix equation. If the matrix Gx,x' is dense, the matrix 
Gx x'^x,x' is constructed using a Matlab mex file (a compiled C file). If Gx,x' is sparse the usual 
multiplication is used. 

3. Solve the matrix equation to obtain the basis function coefficients and evaluate the smoother at 
the desired points. 

5.11.3 Features of the smoothing algorithm and its implementation 

1 The Short user manual and the User manual contains a lot of detail regarding the SmoothOperator 
system and algorithms. So we will content ourselves here with just some key points. 

2 Although the algorithm is scalable there can still be a problem with rapidly increasing memory usage 
as the grid size decreases and the dimension increases. The classical radial basis functions, such as the 
thin plate spline functions, have support everywhere. Hence the smoothing matrix is completely full. To 
significantly reduce this problem we do the following : 

a) We use basis functions with bounded support. 

b) We shrink the basis function support to the magnitude of the grid cells. This makes x'^x,x' 
a very sparse banded diagonal matrix, and Gx',x' has a small number of non-zero diagonals. We say 

this basis function has small support. 

3 Instead of using the memory devouring Matlab repmat function to calculate Gx,x' and Gx',x', we 
directly calculate the arguments of the Matlab function sparse. This function takes three arrays, namely 
the row ids, the column ids and the corresponding matrix elements, as well as the matrix dimensions, 
and converts them to the Matlab sparse internal representation. I must mention that although this is 
quick and space efficient, the algorithm is much more complicated than using repmat. 

Matlab's sparse multiplication facility is then used to quickly and efficiently calculate G^ x'^x,x'- 

4 This software has implemented the above techniques for the smoothing matrix, and a selection of 
basis functions is supplied. 
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5 The tutorials and exercises are based around the zero order tensor product hat (triangle) 
function, denoted by A. The hat function, also known as the triangle function, is used because of its 

simplicity and its analytic properties. The higher dimensional hat functions are defined as the tensor 
product of the one dimensional hat function : A (xi) = 1 — |a;i|, when \xi\ < 1, and zero otherwise. This 
function has zero order so that the smoothing matrix simplifies to 

*G = pNGx',x' + G^^x'Gx,x', Gx',x' = I, (5.66) 

with matrix equation 

*GQ! = Gx',xy- 

Smoothers constructed from the hat function are continuous but not smooth. I have also included the 
B-spline of order 3, namely A * A, which is a basis function of order zero or one. This can be used 
when a smoother is required to be smooth i.e. continuously differentiable. 

6 Note that this software was written before I embarked on the error analysis contained 
in this document so SmoothOperator concentrates on the Approximate smoother and the user cannot 
study the error of the Exact smoother or the Approximate smoother or compare the Approximate 
smoother with the Exact smoother. 

5.11.4 An application - predictive modelling of forest cover type 

In this section wc demonstrate how the method developed in the previous sections can be used in data 
mining for predictive modelling. This application smooths binary-valued data - I was unable to obtain 
a large 'continuous-valued' data set for distribution on the web. The source of our data is the web site 
file: 

http:/ /kdd. ics. uci. edu/ databases/ covertype/ covertype.html, 

in the UCI KDD Archive, Information and Computer Science, University of California, Urvine. 

The data gives the forest cover type in 30 x 30 meter cells as a function of the following cartographic 

parameters: 



Id 


Independent variable 


Description 


1 


ELEVATION 


Altitude above sea level 


2 


ASPECT 


Azimuth 


3 


SLOPE 


Inclination 


4 


HORIZ.HYDRO 


Horizontal distance to water 


5 


VERT.HYDRO 


Vertical distance to water 


6 


HORIZ.ROAD 


Horizontal distance to roadways 


7 


HILL_SHADE_9 


Hill shade at 9am 


8 


HILL_SHADE_12 


Hill shade at noon 


9 


HILL_SHADE_15 


Hill shade at 3pm 


10 


HORIZ.FIRE 


Horizontal distance to fire points 



TABLE 5.3. 



Forest cover type is the dependent variable y and it takes on one of seven values: 



Forest cover type 


Id 


Spruce fir 


1 


Lodge-pole pine 


2 


Ponderosa pine 


3 


Cottonwood/Willow 


4 


Aspen 


5 


Douglas fir 


6 


Krummholtz 


7 



TABLE 5.4. 
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The data file 

In this study we will use the data to train a model predicting on the presence or absence of the 
Ponderosa pine forest cover (id = 3), but the results will be similar for the other forest types. To 
this end wc have created from the full web site file a file called \UserData\forest_l_to_10_pondpin.dat 
which contains the ten independent variables of Table 5.3 and then a binary dependent variable derived 
from the variable Ponderosa pine of Table 5.4. This variable is 1 if the cover is Ponderosa pine and 
zero otherwise. 

Methodology 

We recommend you first use the user interface of the SmoothOperator software to construct artificial data 
sets to understand the behavior of the smoother and run the experiments to get a feel for the influence 

of the parameters. 

1 We chose a small subset of the forest-cover data and selected various smoothing parameters and 
smoothing grid sizes to study the performance of the smoother using plots and the value of the error. 

Note that two subsets of data could be used here, often called the training set and the test set. The 
training set would be used to calculate the smoother for the initial value of the smoothing coefficient, 
and then the test set could be used to determine the smoothing coefficient which minimizes the least 
squares error. 

2 Having chosen our parameters we run the smoother program on the full data set. 
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Data fniS^Cae"'"'') 



0-1 

X 
CO 



-3 



o 

a 
o 




-4 -2 
Log10 data density 



Data fn: R|j(x) 



0-1 

^ 2 
X 



o 

a 
o 



-3 

D)_4 

-5 
-6 




t -2 
Log 10 data density 



Filtered error 



1 

-1 
-2 
-3 
-4 
-5 
-6 



-4 -2 
Log10 data density 

Filtered error 




-4 -2 
Log10 data density 



dim 1, N=1, L=2, spl scale 1, sm parm 1e-6, samp 20, 
pts 2:50K, sm grid cell 16, no ill-condit 



FIGURE 5.6. Error of Approximate smoother vs data density. 
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dim 1, N=2, L=2, spl scale 1/4, sm parm 1e-6, samp 20, pts 2:50K, 
sm grid ceii 16, no iii-condit 



FIGURE 5.7. Error of Approx smoother vs data density: 16 smth grid cells. 
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Data fnia^Cae"""') 



Filtered error 



o 

X 
CD 

E 
o 



o 

X 
CD 

E 
o 



-6 



-4 




-2 
Log10 data density 



Data fn: R^{x) 




-2 
Log10 data density 




-2 
Log 10 data density 

Filtered error 




-4 -2 

Log 10 data density 



dim 1, N=2, L=2, spl scale 1/4, sm parm 1e-6, samp 20, 
pts 2:50K, sm grid cell 8, no ill-condit 



FIGURE 5.8. Error of Approx smoother vs data density: 8 smth grid ceUs. 



Appendix A 

Basic notation, definitions and symbols 



A.l Basic function spaces 

Definition 220 Basic function spaces 

All spaces below consist of complex-valued functions. 

• Pq = {0}. For n > 0, Pn denotes the polynomials of (total) order at most n i.e. degree at most 
n—1 when n > 1. These polynomials have the form ^ tia^"; where S C and ^ e M"^. 

\a\<n 

• C^°^ is the space of continuous functions. 

• is the space of bounded continuous functions. 

• space of continuous functions bounded by a polynomial. 

• C^rn) = {/ e (7(0) : g (7(0)^ ^/jen |a| = m). 

• C^^^ = {/ e Cf : e w/ien |q| < m}. 

• (7^™) = {/ G C'bp : -D"/ e Cfl, w/ien |a| < m}. 

m>0 TO>0 r7i>0 

• Co° is </ie space of C°° functions that have compact support. 
These are the test functions for the space of distributions. 

• S is the (7°° space of rapidly decreasing functions. These are the test functions for the tempered 
distributions of Definition 223 below. 

• For a bounded continuous function onW'-, \\f\\^ = sup |/(a;)|. 

• -^loc space of measurable functions which are absolutely integrable on any compact set i.e. 

any closed, hounded set. 

• is the complete normed vector space of measurable functions f such that J \f\ < oo. Norm is 
ll/lli = /l/l- 
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1 /2 

• LF' is the Hilbert space of measurable functions f such that J \ f\^ < oo. Norm is \\f\\2 = (/ l/l^) 
and inner product is (/, 5)2 = (/ fV)^^^- 

• L°° is the complete normed vector space of essentially bounded functions. The norm is \\f\\^ = 
essup|/(x)|. 



A. 2 Multi-index and vector notation 

Definition 221 Notation 

1. Multi-indexes are vectors with non-negative integer components. 

Let X = {xi,X2, ■■■,Xd) and y = (2/1,7/2, • • - .yd)- 

Suppose ~ is one of the binary operations <, <, =, >, >. 

Write (3~a if j3i~ Ui for all i. 

For X GM. write (3~x if Pi~x for all i. 

The component-wise product is denoted x.y = {xiyi). Component-wise division is denoted x/y = 
{x,/m) orf = (ft). 

Denote 1 = (1, 1, 1, . . . , 1) and let {ek}'l^i be the canonical (or standard) basis of 

d 

2. Denote \a.\ = ^ a,. The D°'f (x) is the derivative of the function f of degree a 

i=l 

D°f{x) = f{x), Dy{x)- 



D^'xiD^'x2...D^''xd 
3. 

monomial x" = x'^^x'^^ . . . x'^'', 



factorial a\ = ai\a2^- ■ ■ ■ a^! and 0! = 1, 



binomial (^^j = if f} < a. 



4. The inequality |a;"| < is used often. 

5. Important identities are 



|a|=fc |a|=fc 



\a\<k 



7. The binomial expansion is 



and a consequence is 
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S. A useful identity is 



9. Leibniz's rule is 



^ ^ fd + k-1 



\a\=k 



0<a 



A.3 Topology 

Definition 222 Topology on 

• The Euclidean norm is denoted by \x\ and the inner product by xy or {x,y). 

d 

Other norms are \x\-^ = J2 '^^'^ kloo ~ iiiax|a;i|. 

• The (open) ball B (a;; r) = {y :\x — y\ <r}. 

• e— neighborhood of a set: for e> Q the e— neighborhood of a set S is Sg = [j B {x; s). 

xes 

• R{a,b) = {x : a < x < b} will denote an open rectangle with left and right points a and b. 

• R[a,b) = {x : a < X < b} will denote the partially open rectangle with left and right points a and b. 

A. 4 Tempered distributions 

Definition 223 Tempered distributions (or generalized functions of slow growth) 

1. S is the space of rapidly decreasing C°° functions We endow S with the topology defined using the 
countable set of seminorms 

||(l + HrD"^||^, ^e5,n = 0,l,2,...; a > 0. 

2. S' denotes the space of tempered distributions or generalized functions of slow growth. It is the set 
of all continuous linear functionals on S under the seminorm topology of part 1. 

3. If f & S' and cf) € S then [/, (/)] G C will represent the action of f on the test function (p. The 
functions in S are called the test functions of S' . 

A. 5 Fourier Transforms 

Definition 224 Fourier and Inverse Fourier transforms on S and S'. 

1. This document uses the two Fourier transform notations 

F [/] = m = (27r)-'/' j e--«/(x)dx, f e S, 

SO that F : S ^ S. The inverse Fourier transform F~^ : S ^ S is 

if] = m = (27r)-'/' I e^^^f{x)dx, f e S. 
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2. The Fourier /Inverse Fourier transforms are extended to mappings from S' to S' by 

feS',<l>eS. 











"v 




v" 






f,4> 


1 






/,0 



Some important properties are: 

1- /(O = /(-O, /(O = /(O and /(O = / (-0- 

2. {f{x-a)f =e-^<J, aeM'*. 

3. (e»^)^ = (27r)''/^^(^ + a), a e M'^. 

4. (/) = i^. 

5. = i\°'\D°'f. 

6. = il"l^"/ = (z^)"/. 

7. r/ = (-*)'"' 

8. ?= (27r)-''/^ and T = (27r)''/^(5. 

9. = (27r)''/2 5 and D^S = {2t^)-'^'^ (-i)'"' 

10. If p is a polynomial then p = (27r)'^^^ p{iD)5 and p {D) f = p (— /• 

A. 6 Convolutions 

Definition 225 Convolution 

If f & and (f) G S then the convolution of f and <j) is denoted by f * cp or (p* f and is defined by 

(0 * /) (x) = {27,)-"/^ j f{x-y)cl> {y) dy = {21,)-"'^ j f(y)ct>{x- y) dy. 
This definition is extended to f G S' by the formulas 

f *cp={dfy = {2n)-''^^[fy, y)], <PeS. (A.2) 

A. 7 Taylor's expansion with integral remainder 

Suppose u € C(") {R'^) for some n>l. Then the Taylor series expansion about z is given by 

u{z + b)=^- (D^u) {z) + {UnU) [z, b) , 



where the integral remainder term 



{Tlnu) {z,b)=ny2 - t'^-\Df^u){z + {l-t) b)dt 
E {^-tT-\0<'n){z + tb)dt, 



l/3|=n 



satisfies the estimate (proved using A.l) 



\iJlnU)(zM< '^^^ max \D^u\ 

n! |/3|=n ' 

te[z,z+6] 



(A.3) 
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